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REGULATING LIPID LEVELS 
VIA THE ZMAX1 OR HBM GENE 

INVENTORS : John P. Carulli, Randall D. Little, Robert R. Recker and Mark L. Johnson 

RELATED APPLICATIONS 

This application is a continuation-in-part of Application No. 09/543,771 filed April 5, 
2000 and Application No. 09/544,398 filed April 5, 2000, which are continuation-in-part 
5 applications of Application No. 09/229,319, filed January 13, 1999, which claims benefit of 
U.S. Provisional Application No. 60/071,449, filed January 13, 1998, and U.S. Provisional 
Application No. 60/105,511, filed October 23, 1998, all of which are herein incorporated by 
reference in their entirety. 

FIELD OF THE INVENTION 

10 The present invention relates generally to the field of genetics, genomics and 

molecular biology. More particularly, the invention relates to methods and materials used to 
isolate, detect and sequence a high bone mass gene and corresponding wild-type gene, and 
mutants thereof that may be involved with modulating lipid levels. The present invention 
also relates to the high bone mass gene, the corresponding wild-type gene, and mutants 

15 thereof. The genes identified in the present invention are implicated in the ontology and 
physiology of atherosclerosis, arteriosclerosis and associated diseases and conditions related 
thereto. The invention also provides nucleic acids, proteins, cloning vectors, expression 
vectors, transformed hosts, methods of developing pharmaceutical compositions, methods of- 
identifying molecules involved in arteriosclerosis and associated conditions, and methods of 

20 treating or preventing diseases associated with abnormal lipid levels. In preferred 
embodiments, the present invention is directed to methods for treating, diagnosing, 
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preventing and screening for noimal and abnormal lipid-associated conditions, including 
arteriosclerosis, cardiovascular disease and stroke. 

BACKGROUND OF THE INVENTION 

Cardiovascular disease is the number one killer in the United States, and 
atherosclerosis is the major cause of heart disease and stroke. It is widely appreciated that 
cholesterol plays an important role in atherogenesis. Normally, most cholesterol serves as a 
structural element in the walls of cells, whereas much of the rest is in transit through the 
blood or functions as the starting material for the synthesis of bile acids in the liver, steroid 
hormones in endocrine cells and vitamin D in skin. The transport of cholesterol and other 
lipids through the circulatory system is facilitated by their packaging into lipoprotein carriers. 
These spherical particles comprise protein and phospholipid shells surrounding a core of 
neutral lipid, including unesterified ("free") or esterified cholesterol and triglycerides. Risk 
for atherosclerosis increases with increasing concentrations of low density lipoprotein (LDL) 
cholesterol, whereas risk is inversely proportional to the levels of high density lipoprotein 
(HDL) cholesterol. The receptor-mediated control of plasma LDL levels has been well- 
defined, and recent studies have now provided new insights .into HDL metabolism. 

The elucidation of LDL metabolism began in 1974 by Michael Brown and Joseph 
Goldstein. In brief, the liver synthesizes a precursor lipoprotein (very low density 
lipoprotein, VLDL) that is converted during circulation to intermediate density lipoprotein 
(SDL) and then to LDL. The majority of the LDL receptors expressed in the body are on the 
surfaces of liver cells, although virtually all other tissues ("peripheral tissues") express some 
LDL receptors. After binding, the receptor-lipoprotein complex is internalized by the cells 
via coated pits and vesicles, and the entire LDL particle is delivered to lysosomes, wherein it 
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is disassembled by enzymatic hydrolysis, releasing cholesterol for subsequent cellular 
metabolism. This whole-particle uptake pathway is called "receptor-mediated endocytosis." 
Cholesterol-mediated feedback regulation of both the levels of LDL receptors and cellular 
cholesterol biosynthesis help ensure cellular cholesterol homeostasis. Genetic defects in the 
5 LDL receptor in humans results in familial hypercholesterolemia, a disease characterized by 
elevated plasma LDL cholesterol and premature atherosclerosis and heart attacks. One 
hypothesis for the deleterious effects of excess plasma LDL cholesterol is that LDL enters the 
artery wall, is chemically modified, and then is recognized by a special class of receptors 
called macrophage scavenger receptors, that mediate the cellular accumulation of the LDL 

10 cholesterol in the artery, eventually leading to the formation of an atherosclerotic lesion. 

The major lipoprotein classes include intestinally derived chylomicrons that transport 
dietary fats and cholesterol, hepatic-derived VLDL, IDL and LDL that can be atherogenic, 
and hepatic- and intestinally-derived HDL that are antiatherogenic. Apoprotein B (ApoB) is 
necessary for the secretion of chylomicrons (Apo B48) and VLDL, IDL and LDL (Apo 

15 B 1 00). Plasma levels of VLDL triglycerides are determined mainly by rates of secretion in 
LPL lipolytic activity. Plasma levels of LDL cholesterol are determined mainly by the 
secretion of Apo B100 into plasma, the efficacy with which VLDL are converted to LDL and 
by LDL receptor-mediated clearance. Regulation of HDL cholesterol levels is complex and 
is affected by rates of synthesis of its Apo proteins, rates of esterfication of free cholesterol to 

20 cholesterol ester by LCAT, levels of triglyceride-rich lipoproteins and CETP-mediated 

transfer of cholesterol esters from HDL, and clearance from plasma of HDL lipids and Apo 
proteins. 

Normal lipoprotein transport is associated with low levels of triglycerides and LDL 
cholesterol and high levels of HDL cholesterol. When lipoprotein transport is abnormal, 
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lipoprotein levels can change in ways that predispose individuals to atherosclerosis and 
arteriosclerosis (see Ginsberg, Endocrinol. Metab. Clin. North Am., 27: 503-19 (1998)). 

Several lipoprotein receptors may be involved-in cellular lipid uptake. These 
receptors include: scavenger receptors; LDL receptor-related protein/a2-macroglobulin 
5 receptor (LRP); LDL receptor; and VLDL receptor. With the exception of the LDL receptor, 
all of these receptors are expressed in atherosclerotic lesions while scavenger receptors are 
mostly expressed in macrophages, the LRP and VLDL receptors may play an important role 
in mediating lipid uptake in smooth muscle cells (Hiltunen et al, Atherosclerosis, 137 suppl.: 
S81-8 (1998)). 

L0 A major breakthrough in the pharmacologic treatment of hypercholesterolemia has 

been the development of the "statin" class of 3-hydroxy : 3-methylglutaryl-CoA reductase 
(HMG CoA reductase) inhibitory drugs. 3-Hydroxy-3-methylglutaryl-CoA reductase is the 
rate controlling enzyme in cholesterol biosynthesis, and its inhibition in the fiver stimulates 
LDL receptor expression. As a consequence, both plasma LDL cholesterol levels and the risk 
15 for atherosclerosis decrease. The discovery and analysis of the LDL receptor system has had 
a profound impact on cell biology, physiology, and medicine. 

HDL is thought to remove unesterified, or "free" cholesterol (FC) from peripheral 
tissues, after which most of the cholesterol is converted to cholesteryl ester (CE) by enzymes 
in the plasma. Subsequently, HDL cholesterol is efficiently delivered directly to the fiver and 
20 steroidogenic tissues via a selective uptake pathway and the HDL receptor, SR-BI (class B 
type I scavenger receptor) or, in some species, transferred to other lypoproteins for additional 
transport in metabolism. For additional discussion on HDL and LDL metabolism see 
Krieger, Proc. Natl. Acad. Sci. USA, 95:4077-4080, 1998. 
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Recently, a strong interest in the genetic control of peak bone mass has developed in 
the field of osteoporosis. The interest has focused mainly on candidate genes with suitable 
polymorphisms to test for association with variation in bone mass within the normal range, or 
has focused on examination of genes and gene loci associated with low bone mass in the 
5 range found in patients with osteoporosis. The vitamin D receptor locus (VDR) (Morrison et 
al, Nature, 367:284-287 (1994)), PTHgene (Howard et al,J. Clin. Endocrinol Metab., 
80:2800-2805 (1995); Johnson et al, J. Bone Miner. Res., 8:11-17 (1995); Gong et al., J. 
Bone Miner. Res., 10:S462 (1995)) and the estrogen receptor gene (Hosoi et al, J. Bone 
Miner. Res., 10:S170 (1995); Morrison et al, Nature, 367:284-287 (1994)) have figured most 
10 prominently in this work. These studies are difficult because bone mass (the phenotype) is a 
continuous, quantitative, polygenic trait, and is confounded by environmental factors such as 
nutrition, co-morbid disease, age, physical activity, and other factors. Also, this type of study 
design requires large numbers of subjects. In particular, the results of VDR studies to date 
have been confusing and contradictory (Gamero et al, J. Bone Miner. Res., 10:1283-1288 
15 (1995); Eisman et al, J. Bone. Miner. Res., 10:1289-1293 (1995); Peacock, J. Bone Miner. 
Res., 10:1294-1297 (1995)). Furthermore, the work thus far has not shed much light on the 
mechanism(s) whereby the genetic influences might exert their effect on bone mass. 

While it is well known that peak bone mass is largely determined by genetic rather 
than environmental factors, studies to determine the gene loci (and ultimately the genes) 
20 linked to variation in bone mass are difficult and expensive. Study designs which utilize the 
power of linkage analysis, e.g., sib-pair or extended family, are generally more informative 
than simple association studies, although the latter do have value. However, genetic linkage 
studies involving bone mass are hampered by two major problems. The first problem is the 
phenotype, as discussed briefly above. Bone mass is a continuous, quantitative trait, and 
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establishing a discrete phenotype is difficult. Each anatomical site for measurement may be 
influenced by several genes, many of which may be different from site to site. The second 
problem is the age component of the phenotype. By the time an individual can be identified 
as having low bone mass, there is a high probability that their parents or other members of 
5 prior generations will be deceased and therefore unavailable for study, and younger 

generations may not have even reached peak bone mass, making their phenotyping uncertain 
for genetic analysis. 

Regardless, linkage analysis can be used to find the location. of a gene causing a 
hereditary "disorder" and does not require any knowledge of the biochemical nature of the 

10 disorder, i.e., a mutated protein that is believed to cause the disorder does not need to be 
known. Traditional approaches depend on assumptions concerning the disease process that 
might implicate a known protein as a candidate to be evaluated. The genetic localization 
approach using linkage analysis can be used to first find the general chromosomal region in 
which the defective gene is located and then to gradually reduce the size of the region in 

15 order to determine the location of the specific mutated gene as precisely as possible. After 
the gene itself is discovered within the candidate region, the messenger KNA and the protein 
are identified and, along with the DNA,'are checked for mutations. 

The genetic localization approach has practical implications since the location of the 
disease can be used for prenatal diagnosis even before the altered gene that causes the disease 

20 is found. Linkage analysis can enable families, even many of those that do not have a sick 
child, to know whether they axe carriers of a disease gene and to evaluate the condition of an 
unborn child through molecular diagnosis. The transmission of a disease within families, 
then, can be used to find the defective gene. As used herein, reference to "high bone mass" 
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(HBM) is analogous to reference to a disease state, although from a practical standpoint high 
bone mass can actually help a subject avoid the disease known as osteoporosis. 

Linkage analysis is possible because of the nature of inheritance of chromosomes 
from parents to offspring. During meiosis, the two parental homologues pair to guide their 
5 proper separation to daughter cells. While they are lined up and paired, the two homologues 
exchange pieces of the chromosomes, in an event called "crossing over" or "recombination." 
The resulting chromosomes are chimeric, that is, they contain parts that originate from both 
parental homologues. The closer together two sequences are on the chromosome, the less 
likely that a recombination event will occur between them, and the more closely linked they 
10 are. In a linkage analysis experiment, two positions on the chromosomes are followed from 
one generation to the next to determine the frequency of recombination between them. In a 
study of an inherited disease, one of the chromosomal positions is marked by the disease gene 
or its normal counterpart, i.e., the inheritance of the chromosomal region can be determined 
by exarriining whether the individual displays symptoms of the disorder or not The other 
15 position is marked by a DNA sequence that shows natural variation in the population such 
that the two homologues can be distinguished based on the copy of the "marker" sequence 
that they possess. In every family, the inheritance of the genetic marker sequence is 
compared to the inheritance of the disease state. If, within a family carrying an autosomal 
dominant disorder such as high bone mass, every affected individual carries the same form of 
20 the marker and all the unaffected individuals carry at least one different form of the marker, 
there is a great probability that the disease gene and the marker are located close to each 
other. In this way, chromosomes may be systematically checked with known markers and 
compared to the disease state. The data obtained from the different families is combined, and 
analyzed together by a computer using stausticaLmethods. The result is information 
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indicating the probability of linkage between the genetic marker and the disease allowing 
different distances between them. A positive result can mean that the disease is very close to 
the marker, while a negative result indicates that it is far away on that chromosome, or on an 
entirely different chromosome. 
5 Linkage analysis is performed by typing all members of the affected family at a given 

marker locus and evaluating the co-inheritance of a particular disease state with the marker 
probe, thereby determining how often the two of them are co-inherited. The recombination 
frequency can be used as a measure of the genetic distance between two gene loci. A 
recombination frequency of 1 % is equivalent to 1 map unit, or 1 centiMorgan (cM), which is 
10 roughly equivalent to 1,000 kb of DNA. This relationship holds up to frequencies of about 
20% or20cM. 

The entire human genome is 3,300 cM long. In order to find an unknown disease 
gene within 5-10 cM of a marker locus, the whole human genome can be searched with 
roughly 330 informative marker loci spaced at approximately 10 cM intervals (Botstein et al, 

15 Am, J. Hum. Genet, 32:314-331 (1980)). The reliability of linkage results is established by 
using a number of statistical methods. The method most commonly used for the analysis of 
linkage in humans is the LOD score method (Morton, Prog T Clin. Biol. Res. 9 147:245-265 
(1984), Morton al, Am. J. Hum. Genet, 38:868-883 (1986)) which was incorporated into 
the computer program LIPED by Ott, Am. J. Hum. Genet, 28:528-529 (1976). LOD scores 

20 are the logarithm of the ratio of the likelihood that two loci are linked at a given distance to 
that they are not linked (>50 cM apart). The advantage of using logarithmic values is that 
they can be summed among families with the same disease. This becomes necessary given 
the relatively small size of human families. 
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By convention, a total LOD score greater than + 3.0 (that is, odds of linkage at the 
specified recombination frequency being 1000 times greater than odds of no linkage) is 
considered to be significant evidence for linkage at that particular recombination frequency. 
A total LOD score of less than - 2.0 (that is, odds of no linkage being 100 times greater than 

5 odds of linkage at the specified frequency) is considered to be strong evidence that the two 
loci under consideration are not linked at that particular recombination frequency. Until 
recently, most linkage analyses have been performed on the basis of two-point data, which is 
the relationship between the disorder under consideration and a particular genetic marker. 
However, as a result of the rapid advances in mapping the human genome over the last few 

10 years, and concomitant improvements in computer methodology, it has become feasible to 
carry out linkage analyses using multi-point data. Multi-point analysis provide a 
simultaneous analysis of linkage between the disease and several linked genetic markers, 
when the recombination distance among the markers is known. 

Multi-point analysis is advantageous for two reasons. First, the informativeness of the 

15 pedigree is usually increased. Each pedigree has a certain amount of potential information, 
dependent on the number of parents heterozygous for the marker loci and the number of 
affected individuals in the family. However, few markers are sufficiently polymorphic as to 
be informative in all those individuals. If multiple markers are considered simultaneously, 
then the probability of an individual being heterozygous for at least one of the markers is 

20 greatly increased. Second, an indication of the position of the disease gene among the 
markers may be determined. This allows identification of flanking markers, and thus 
eventually allows isolation of a small region in which the disease gene resides. Lathrop et 
al. , Proc. Natl. Acad. Sri. USA, 8 1 :3443-3446 (1984) have written the most widely used 
computer package, LINKAGE, for multi-point analysis. 
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There is a need in the art for identifying the gene associated with a high bone mass 
phenotype. The present invention is directed to this, as well as other, important ends. 

SUMMARY OF THE INVENTION 

The present invention describes the Zmaxl gene and the HBM gene on chromosome 

5 1 1 ql 3 .3 by genetic linkage and mutation analysis. The use of additional genetic markers 
linked to the genes has aided this discovery. By using linkage analysis and mutation analysis, 
persons predisposed to lipid associated disorders may be readily identified. Cloning methods 
using Bacterial Artificial Chromosomes have enabled the inventors to focus on the 
chromosome region of 1 lql3.3 and to accelerate the sequencing of the autosomal dominant 

10 gene. In addition, the invention identifies the Zmaxl gene and the HBM gene, and identifies 
the guanine-to-thymine polymorphism mutation at position 582 in the Zmaxl gene that 
produces the HBM gene and the HBM phenotype as well as altered lipid levels. 

The present invention identifies the Zmaxl gene and the HBM gene, which can be 
used to determine if people are predisposed to abnormal lipid levels and, therefore, 

15 susceptible to diseases mediated by lipids, including, for example, atherosclerosis, 

arteriosclerosis and associated conditions. Individuals with the HBM gene have lower LDL, 
triglyceride and VLDL levels and higher HDL levels. In other words, the HBM gene is a 
suppressor of atherosclerosis, arteriosclerosis and associated conditions. This in vivo 
observation is a strong evidence that treatment of normal individuals with the HBM gene or 

20 protein, or fragments thereof, will ameliorate atherosclerosis, arteriosclerosis and conditions 
related thereto. 

Moreover, such treatment will be indicated in the treatment of lipid-mediated 
diseases, particularly arteriosclerosis and conditions related thereto. For example, persons 

-10- 
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predisposed to elevated lipid levels (i.e., diabetes, hypercholesteremia and other genetic 
diseases, obesity, male gender, and individuals who smoke) may be identified and/or treated 
by means of the invention. Moreover, the methods and compositions of the invention will be 
of use in the treatment or prevention of diabetic atherosclerotic disease, neurovascular 
5 conditions caused by plaque build-up {e.g., stroke), cardiovascular disease, poor circulation 
due to plaque build-up ad associated poor would healing. 

In various embodiments, the present invention is directed to nucleic acids, proteins, 
" vectors, and transformed hosts of HBM and Zmaxl . 

Additionally, the present invention is directed to applications of the above 
10 embodiments of the invention including, for example, gene therapy, pharmaceutical 
development, and diagnostic assays for bone development disorders. In preferred 
embodiments, the present invention is directed to methods for treating, diagnosing, 
preventing and screening for osteoporosis. 

These and other aspects of the present invention are described in more detail below. 

15 ttRTFF DESCKTPTTON O F TTTR FIGURES 

Fig. 1 shows the pedigree of the individuals used in. the genetic linkage studies. 

Under each individual is an ID number, the z-score for spinal BMD, and the allele calls for 

the critical markers on chromosome 11. Solid symbols represent "affected" individuals. 

Symbols containing "N" are "unaffected" individuals. DNA from 37 individuals was 
20 genotyped. Question marks denote unknown genotypes or individuals who were not 

genotyped. 

Fig. 2 depicts the BAC/STS content physical map of the HBM region in llql3.3. 
STS markers derived from genes, ESTs, microsatellites, random sequences, and B AC 



-11- 



BNSDOCID: <WO 0192B91 A2_1_> 



WO 01/92891 



PCTAJS01/16946 



endsequences are denoted above the long horizontal line. For markers that are present in 
GDB the same nomenclature has been used. Locus names (D11S####) are listed in 
parentheses after the primary name if available. STSs derived from BAC endsequences are 
listed with the BAC name first followed by L or R for the left and right end of the clone, 
5 respectively. The two large arrows indicate the genetic markers that define the HBM critical 
region. The horizontal lines below the STSs indicate BAC clones identified by PCR-based 
screening of a nine-fold coverage BAC library. Open circles indicate that the marker did not 
amplify the corresponding BAC library address during library screening. Clone names use 
the following convention: B for BAC, the plate, row and column address, followed by -H 
10 indicating the HBM project (i.e., B36F16-H). 

Figs. 3A-3F show the genomic structure of Zmaxl with flanking intron sequences. 
Translation is initiated by the underlined "ATG" in exon 1 . The site of the polymorphism in 
the HBM gene is in exon 3 and is represented by the underlined »G," whereby this nucleotide 
is a "T» in the HBM gene. The 3' untranslated region of the mRNA is underlined within exon 
15 23 (exon 1, SEQ ID NO:40; exon 2, SEQ ID NO:41; exon 3, SEQ ID NO:42; exon 4, SEQ 
ID NO:43; exon 5, SEQ ID NO:44; exon 6, SEQ ID NO:45; exon 7, SEQ ID NO:46; exon 8, 
SEQ ID NO:47; exon 9, SEQ ID NO:48; exon 10, SEQ ID .NO:49; exon 11, SEQ ID NO:50; 
exon 12, SEQ ID NO:51; exon 13, SEQ ID NO:52; exon 14, SEQ ID NO:53; exon 15, SEQ 
ID NO:54; exon 16, SEQ ID NO:55; exon 17, SEQ ID NO:56; exon 18, SEQ ID NO:57; 
20 exon 19, SEQ ID NO:58; exon 20, SEQ ID NO:59; exon 21, SEQ ID NO:60; exon 22, SEQ 
ID NO:61; and exon 23; SEQ ID NO:62). 

Fig. 4 shows the domain organization of Zmaxl, including the YWTD spacers, the 
extracellular attachment site, the binding site for LDL and calcium, the cysteine-rich growth 
factor repeats, the transmembrane region, the ideal PEST region with the CK-H 



12- 



BNSDOCID: <WO 0192B91A2 t > 



WO 01792891 



PCT/US01/16946 



phosphorylation site and the internalization domain. Fig. 4 also shows the site of the glycine 
to valine change that occurs in the HBM protein. The signal peptide is located at amino acids 
1-22, the extracellular domain is located at amino acids 23-1385, the transmembrane segment 
is located at amino acids 1386-1413, and the cytoplasmic domain is located at amino acids 
5 1414-1615. 

Fig. 5 is a schematic illustration of the BAC contigs B527D12 and B200E21 in 
• relation to the HBM gene. 

Figs. 6A-6E are the nucleotide and amino acid sequences of the wild-type gene, 
Zmaxl. The location for the base pair substitution at nucleotide 582, a guanine to myrnine, is 
10 underlined. This allelic variant is the HBM gene. The HBM gene encodes for a protein with 
an amino acid substitution of glycine to valine at position 171. The 5' untranslated region 
(UTR) boundaries bases 1 to 70, and the 3' UTR boundaries bases 4916-5120. 

Figs. 7 A and 7B are northern blot analyses showing the expression of Zmaxl in 

various tissues. 
15 Fig. 8 is a PCR product analysis. 

Fig. 9 is allele specific oligonucleotide detection of the Zmaxl exon 3 mutation. 
Fig. 10 is the cellular localization of mouse Zmaxlby in situ hybridization at 100X 
magnification using sense and antisense probes. 

Fig. 11 is the cellular localization of mouse Zmaxl by in situ hybridization at 400X 
20 magnification using sense and antisense probes. 

Fig. 12 is the cellular localization of mouse Zmaxl by in situ hybridization of 
osteoblasts in the endosteum at 400X magnification using sense and antisense probes. 
Fig. 13 shows antisense inhibition of Zmaxl expression in MC-3T3 cells. 



-13- 



BNSDOC1D: <WO 0 1 S2S9 1 A2J_> 



WO 01/92891 



PCT/US01/16946 



TbTF.TATT .TCP py.sraiPHQ ™ oy TT ™- INVE1STIQM 

To aid in the understanding of the specification and claims, the following definitions 

are provided. 

"Gene" refers to a DNA sequence that encodes through its template or messenger 
5 RNA a sequence of amino acids characteristic of a specific peptide. The term "gene- 
includes intervening, non-coding regions, as well as regulatory regions, and can include 5» 
and 3' ends. 

"Gene sequence" refers to a DNA molecule, including hoth a DNA molecule which 
contains a non-transcribed or non-translated sequence. The term is also intended to include 
10 any combination of gene(s), gene fragments), non-transcribed sequence^) or non-translated 
sequence(s) which, are present on the same DNA molecule. 

The sequences of the present invention may be derived from a variety of sources 
including DNA, cDNA, synthetic DNA, synthetic RNA or combinations thereof. Such 
sequences may comprise genomic DNA which may or may not include naturally occurring 
15 introns. Moreover, such genomic DNA may be obtained in association with promoter 

regions or poly (A) sequences. The sequences, genomic DNA or cDNA maybe obtained in 
any of several ways/ Genomic DNA can be extracted and purified from suitable cells by 
means well known in the art Alternatively, mRNA can be isolated from a cell and used to 
produce cDNA by reverse transcription or other means. 

» cDNA" refers to complementary or copy DNA produced from an RNA template by 
the action of RNA-dependent DNA polymerase (reverse transcriptase). Tims, a "cDNA 
clone" means a duplex DNA sequence complementary to an RNA molecule of interest, 
carried in a cloning vector or PCR amplified. This term includes genes from which the 
intervening sequences have been removed. 



20 
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"Recombinant DNA" means a molecule that has been recombined by in vitro splicing 
cDNA or a genomic DNA sequence. 

"Cloning" refers to the use of in vitro recombination techniques to insert a particular 
gene or other DNA sequence into a vector molecule. In order to successfully clone a desired 
5 gene, it is necessary to use methods for generating DNA fragments, for joining the fragments 
to vector molecules, for introducing the composite DNA molecule into a host cell in which it 
can replicate, and for selecting the clone having the target gene from amongst the recipient 
host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules containing 
10 cDNA inserts which together comprise the entire genome of an organism. Such a cDNA 
library can be prepared by methods known to one skilled in the art and described by, for 
example, Cowell and Austin, "cDNA Library Protocols," Methods in Molecular Biology 
(1997). Generally, RNA is first isolated from the cells of an organism from whose genome it 
is desired to clone a particular gene. 
15 "Cloning vehicle" refers to a plasmid or phage DNA or other DNA sequence which is 

able to replicate in a host cell. The cloning vehicle is characterized by one or more 
endonuclease recognition sites at which such DNA sequences may be cut in a determinable 
fashion without loss of an essential biological function of the DNA, which may contain a 
marker suitable for use in the identification of transformed cells. 
20 "Expression control sequence" refers to a sequence of nucleotides that control or 

regulate expression of structural genes when operably linked to those genes. These include, 
for example, the lac systems, the tip system, major operator and promoter regions of the 
phage lambda, the control region of fd coat protein" and other-sequences known to control the 
expression of genes in prokaryotic or eukaryotic cells. Expression control sequences will 
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vary depending on whether the vector is designed to express the operably linked gene in a 
prolcaryotic or eukaryotic host, and may contain transcriptional elements such as enhancer 
elements, termination sequences, tissue-specificity elements and/or translational initiation and 
termination sites. 

5 "Expression vehicle" refers to a vehicle or vector similar to a cloning vehicle but 

which is capable of expressing a gene which has been cloned into it, after transformation into 
a host. The cloned gene is usually placed under the control of (i.e., operably linked to) an 
expression control sequence. 

"Operator" refers to a DNA sequence capable of interacting with the specific 
10 repressor, thereby controlling the transcription of adjacent gene(s). 

"Promoter" refers to a DNA sequence* that can be recognized by "an RNA polymerase. 
The presence of such a sequence permits the RNA polymerase to bind and initiate 
transcription of operably linked gene sequences. 

"Promoter region" is intended to include the promoter as well as other gene sequences 
15 which may be necessary for the initiation of transcription. The presence of a promoter region 
is sufficient to cause the expression of an operably linked gene sequence. 

"Operably linked" means that the promoter controls the initiation of expression of the 
gene. A promoter is operably linked to a sequence of proximal DNA if upon introduction 
into a host cell the promoter determines the transcription of the proximal DNA sequence(s) 
20 into one or more species of RNA. A promoter is operably linked to a DNA sequence if the 
promoter is capable of initiating transcription of that DNA sequence. 

"Prokaryote" refers to all organisms without a true nucleus, including bacteria. 

"Eukaryote" refers to organisms and cells that have a true nucleus, including 
mammalian cells. 
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"Host" includes prokaryotes and eukaiyotes, such as yeast and filamentous fungi, as 
well as plant and animal cells. The term includes an organism or cell that is the recipient of a 
replicable expression vehicle. 

By "animal" is meant to include vertebrates. Preferred vertebrates include mammals 
5 and birds, but also include fish, reptiles and amphibians. Preferred mammals include: 
humans, primates, rodents, canines, felines and livestock. 

"Fragment" of a gene refers to any variant of the gene that possesses the biological 

activity of that gene. 

"Variant" refers to a gene that is substantially similar in structure and biological 

10 activity or immunological characteristics to either the entire gene or to a fragment of the 
gene. Provided that the two genes possess a similar activity, they are considered variant as 
that term is used herein even if the sequence of amino acid residues is not identical. 

"Amplification of nucleic acids" refers to methods such as polymerase chain reaction 
(PCR), ligation amplification (or ligase chain reaction, LCR) and amplification methods 

15 based on the use of Q-beta replicase. These methods are well known in the art and described, 
for example, in U.S. Patent Nos. 4,683,195 and 4,683,202. Reagents and hardware for 
conducting PCR are commercially available: Primers useful for amplifying sequences from 
the HBM region are preferably complementary to, and hybridize specifically to sequences in 
the HBM region or in regions that flank a target region therein. HBM sequences generated 

20 by amplification may be sequenced directly. Alternatively, the amplified sequence(s) may be 
cloned prior to sequence analysis. 

"Antibodies" may refer to polyclonal and/or monoclonal antibodies and fragments 
thereof, and immunologic binding equivalents thereof, that can bind to the HBM and Zmaxl 
proteins and fragments thereof or to nucleic acid sequences from the HBM or Zmaxl region, 
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particularly from the HBM locus or a portion thereof. The term antibody is used both to refer 
to a homogeneous molecular entity, or a mixture such as a serum product made up of a 
plurality of different molecular entities. Proteins may be prepared synthetically in a protein 
synthesizer and coupled to a carrier molecule and injected over several months into rabbits. 
5 Rabbit sera is tested for immunoreactivity to the HBM protein or fragment. Monoclonal 
antibodies may be made by injecting mice with the proteins, or fragments thereof. ' 
Monoclonal antibodies will be screened by ELISA and tested for specific immunoreactivity 
with HBM protein or fragments thereof. Harlow et aL, Antibodies: A Laboratory Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1988). These antibodies will be 
10 useful in assays as well as pharmaceuticals. Antibodies can include antibody fragments {e.g., 
scFv, Fab, F(ab') 2 > etc.) as well as human antibodies, humanized antibodies and primatized 
antibodies. 

"HBM" refers to high bone mass, but polymorphisms associated with .HBM gene, 
which can also be involved in lipid modulation. 

15 "HBM protein" refers to a protein that is identical to a Zmaxl protein except that it 

contains an alteration of glycine 171 to valine. An HBM protein is defined for any organism 
that encodes a Zmaxl true homologue. For example, a mouse HBM protein refers to the 
mouse Zmaxl protein having .the glycine 170 to valine substitution. 

"iZBMgene" refers to the genomic DNA sequence found in individuals showing the 

20 HBM characteristic or phenotype, where the sequence encodes the protein indicated by SEQ 
ID NO: 4. The HBM gene and the Zmaxl gene are allelic. The protein encoded by the HBM 
gene has the property of causing elevated bone mass and also altering physiologic lipid 
levels, while the protein encoded by the Zmaxl gene does not. The HBM gene and the Zmaxl 
gene differ in that the HBM gene has a thymine at position 582, while the Zmaxl gene has a 
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guanine at position 582. The HSMgene comprises the nucleic acid sequence shown as SEQ 
ID NO: 2. The HBM gene may also be referred to as an' "HBM polymorphism." 

"Normal," "wild-type," "unaffected" and "Zmaxl " all refer to the genomic DNA 
sequence that encodes the protein indicated by SEQ ID NO: 3. The Zmaxl gene has a 
5 guanine at position 582. The Zmaxl gene comprises the nucleic acid sequence shown as SEQ 
ED NO: 1. "Normal," "wild-type," "unaffected" and "Zmaxl" also refer to allelic variants of 
the genomic sequence that encodes proteins that do not contribute to elevated bone mass. 
The Zmaxl gene is common in the human population, while the HBM gene is rare. 

"5YWT+EGF" refers to a repeat unit found in the Zmaxl protein, consisting of five 
10 YWT repeats followed by an EGF repeat. 

"Bone development" generally refers to any process involved in the change of bone 
over time, mcluding, for example, normal development, changes that occur during disease 
states, and changes that occur during aging. "Bone development disorder" particularly refers 
to any disorders in bone development including, for example, changes that occur during 
15 disease states and changes that occur during aging. Bone development may be progressive or 
cyclical in nature. Aspects of bone that may change during development include, for 
example, mineralization, formation of specific anatomical features, and relative or absolute 
numbers of various cell types. 

"Bone modulation" or "modulation of bone formation" refers to the ability to affect 
20 any of the physiological processes involved in bone remodeling, as will be appreciated by one 
skilled in the art, including, for example, bone resorption and appositional bone growth, by, 
inter alia, osteoclastic and osteoblastic activity, and may comprise some or all of bone 
formation and development as used herein. 
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"Normal bone density" refers to a bone density within two standard aeviauons 01 a z, 
score of 0. 

By "lipid regulation" or "lipid modulation" is meant the ability to alter by modulating 
thoHSMovZmaxJ genes, mRNA or protein encoded thereby the levels of a lipid. Altered 
5 levels of lipid include very low density lipoproteins (VLDL), low density lipoproteins (LDL), 
high density lipoprotein (HDL) and triglycerides. Tne regulation or modulation can be an 
increase or decrease in the lipid level by an agent, which when administered to a subject 
modulates HBM or Zmaxl activity. By "lipid metabolism" is meant the physiological cycle . 
through which the various triglycerides and lipoproteins proceed. Agents of the invention 
10 can also be said to modulate the metabolism of various lipids. 

"Lipid" preferably includes very low density lipoproteins (VLDL), low density 
lipoproteins (LDL), intermediate density lipoprotein (IDL), high density lipoprotein (HDL) 
and triglycerides. Lipids can also include apolipoproteins, such as apolipoprotein A-l (APO 
A-l), apolipoprotein B (APO B), apohprotein E (APO E) and lipoproteins such as lipoprotein 
15 a(LIPOa). ' 

By "lipid-mediated disease or condition" is meant to include arteriosclerosis and 
related conditions, hypercholesteremia, hyperhpidemia, atherosclerosis, and conditions or 
lifestyles associated with elevated lipid levels (e.g.. diabetes mellitus, smoking and obesity) 
such as those discussed herein. 
20 By "arteriosclerosis" is meant to include hypertrophy of the media and subintimal 

fibrosis with hyaline degeneration which can result in ectasia, aneurysm, increased systolic 
pressure, thrombus formation and embolism. Disorders associated with arteriosclerosis 
include, but are not limited to, nonatheromatous arteriosclerosis conditions such as: diabetes 
mellitus, chronic renal insufficiency, chronic vitamin D intoxication, pseudoxanthoma 
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elasticum 3 idiopatlric arterial calcification in infancy, aortic valvular calcification in the 
elderly, and Werner's syndrome. Additional disorders associated with arteriosclerosis and 
atherosclerosis include: diabetes mellitus, hypertension, familial hypercholesterolemia, 
familial combined hyperlipidemia, familial dysbetalipoproteinemia, familial 
5 hypoalphalipoproteinemia, hypothyroidism, cholesterol ester storage disease, systemic lupus 
erythematosus and homocysteinemia. 

By "atherosclerosis" is meant patchy intramural thickening of the subintima that 
encroaches on the arterial lumen and can cause obstruction. Atherosclerotic plaque consists 
of the accumulation of lipids, cells, annective tissue and glycosaminoglycans. It can cause 
10 the following conditions: stenosis, thrombosis, aneurysm, or embolus supervenes, as well as 
angina as well as the conditions listed above. 

A "Zmaxl system" refers to a purified protein, cell extract, cell, animal, human or any 
other composition of matter in which Zmaxl is present in a normal or mutant form. 

A "surrogate marker" refers to a diagnostic indication, symptom, sign or other feature 
15 that can be observed in a cell, tissue, human or animal that is correlated with the HBM gene 
or elevated bone mass or both, but that is easier to measure than bone density. The general 
concept of a surrogate marker is well accepted in diagnostic. medicine. 

The present invention encompasses the Zmaxl gene and Zmaxl protein in the forms 
indicated by SEQ ID NOS: 1 and 3, respectively, and other closely related variants, as well as 
20 the adjacent chromosomal regions of Zmaxl necessary for its accurate expression In a 

preferred embodiment, the present invention is directed to at least 15 contiguous nucleotides 
of the nucleic acid sequence of SEQ JD NO: 1 . 

The present invention also encompasses the HBM gene and HBM protein in the forms 
indicated by SEQ ED NO: 2 and 4, respectively, and other closely related variants, as well as 
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the adjacent chromosomal regions of the HBM gene necessary for its accurate expression. In 
a preferred embodiment, the present invention is directed to at least 15 contiguous 
nucleotides of the nucleic acid sequence of SEQ ID NO: 2. More preferably, the present 
invention is directed to at least 15 contiguous nucleotides of the nucleic acid sequence of 
5 SEQ ID NO: 2, wherein one of the 15 contiguous nucleotides is the thymine at nucleotide 
582. 

The invention also relates to the nucleotide sequence of the Zmaxl gene region, as 
well as the nucleotide sequence of the HBM gene region. More particularly, a preferred 
embodiment are the BAG clones containing segments of the Zmaxl gene region B200E21-H 

10 and B527D 12-H. A preferred embodiment is the nucleotide sequence of the BAC clones 
consisting of SEQ ID NOS: 5-12. 

The invention also concerns the use of the nucleotide sequence to identify DNA 
probes for the Zmaxl gene and the HBM gene, PCR primers to amplify the Zmaxl gene and 
the HBM gene, nucleotide polymorphisms in the Zmaxl gene and the HBM gene, and - 

15 regulatory elements of the Zmaxl gene and the HBM gene. 

This invention describes the further localization of the chromosomal location of the 
Zmaxl gene and HBM gene on chromosome llql3.3 between genetic markers D11S987 and 
SNP_CONTIG033-6, as well as the DNA sequences of the Zmaxl gene and the HBM gene. 
The chromosomal location was refined by the addition of more genetic markers to the 

20 mapping panel used to map the gene, and by the extension of the pedigree to include more 
individuals. The pedigree extension was critical because the new individuals that have been 
genotyped harbor critical recombination events that narrow the region. To identify genes in 
the region on 1 lql3 .3, a set of BAC clones containing this chromosomal region was 
identified. The BAC clones served as a template for genomic DNA sequencing, and also as a 
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reagent for identifying coding sequences by direct cDNA selection. Genomic sequencing and 
direct cDNA selection were used to characterize more than 1 .5 million base pairs of DNA 
from 1 lql3.3. The Zmaxl gene was identified within this region and the HBM gene was then 
discovered after mutational analysis of affected and unaffected individuals. 
5 When a gene has been genetically localized to a specific chromosomal region, the 

genes in this region can be characterized at the molecular level by a series of steps that 
include: cloning of the entire region of DNA in a set of overlapping clones (physical 
mapping), characterization of genes encoded by these clones by a combination of direct 
cDNA selection, exon trapping and DNA sequencing (gene identification), and identification 
10 of mutations in these genes by comparative DNA sequencing of affected and unaffected 
members of the HBM kindred (mutation analysis). 

Physical mapping is accomplished by screening libraries of human DNA cloned in 
vectors that are propagated in E. coli or S. cereviseae using PCR assays designed to amplify 
unique molecular landmarks in the chromosomal region of interest. To generate a physical 
15 - map of the HBM candidate region, a library of human DNA cloned in Bacterial Artificial 
Chromosomes (BACs) was screened with a set of Sequence Tagged Site (STS) markers that 
had been previously mapped to chromosome 1 Iql2-ql3 by.the efforts of the Human Genome 
Project. 

STSs are unique molecular landmarks in the human genome that can be assayed by 
20 PCR- Through the combined efforts of the Human Genome Project, the location of thousands 
of STSs on the twenty-two autosomes and two sex chromosomes has been determined. For a 
positional cloning effort, the physical map is tied to the genetic map because the markers 
used for genetic mapping can also be used as STSs for physical mapping. By screening a 
BAC library with a combination of STSs derived from genetic markers, genes, and random 
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DNA fragments, a physical map comprised of overlapping clones representing all of the 
DNA in a chromosomal region of interest can be assembled. 

BACs are cloning vectors for large (80 kilobase to 200 kilobase) segments of human 
or other DNA that are propagated in E. colt To construct a physical map using BACs, a 
5 library of BAC clones is screened so that individual clones harboring the DNA sequence 
corresponding to a given STS or set of STSs are identified. Throughout most of the human 
genome, the STS markers are spaced approximately 20 to 50 kilobases apart, so that an 
individual BAC clone typically contains at least two STS markers. In addition, the BAC 
libraries that were screened contain enough cloned DNA to cover the human genome six 
10 times over. Therefore, an individual STS typically identifies more than one BAC clone. By 
screening a six-fold coverage BAC library with a series of STS markers spaced 
approximately 50 kilobases apart, a physical map consisting of a series of overlapping BAC 
clones, i.e. BAC contigs, can be assembled for any region of the human genome. This map is 
closely tied to the genetic map because many of the STS markers used to prepare the physical 
15 map are also genetic markers. 

When constructing a physical map, it often happens that there are gaps in the STS 
map of the genome that result in the inability to identify BAC clones that are overlapping in a 
given location. Typically, the physical map is first constructed from a set of STSs that have 
been identified through the publicly available literature and World Wide Web resources. The 
20 initial map' consists of several separate BAC contigs that are separated by gaps of unknown 
molecular distance. To identify BAC clones that fill these gaps, it is necessary to develop 
new STS markers from the ends of the clones on either side of the gap. This is done by 
sequencing the terminal 200 to 300 base pairs of the BACs flanking the gap, and developing 
a PCR assay to amplify a sequence of 100 or more base pairs. If the terminal sequences are 
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demonstrated to be unique within the human genome, then the new STS can be used to screen 
the BAC library to identify additional BACs that contain the DNA from the gap in the 
physical map. To assemble a BAC contig that covers a region the size of the HBM candidate 
region (2,000,000 or more base pairs), it is often necessary to develop new STS markers from 
5 the ends of several clones. 

After building a BAC contig, this set of overlapping clones serves as a template for 
• identifying the genes encoded in the chromosomal region. Gene identification can be 
accomplished by many methods. Three methods are commonly used: (1) a set of BACs 
selected from the BAC contig to represent the entire chromosomal region can be sequenced, 
10 and computational methods can be used to identify all of the genes, (2) the BACs from the 
BAC contig can be used as a reagent to clone cDNAs corresponding to the genes encoded in 
the region by a method termed direct cDNA selection, or (3) the BACs from the BAC contig 
can be used to identify coding sequences by selecting for specific DNA sequence motifs in a 
procedure called exon trapping. The present invention includes genes identified by the first 
15 two methods. 

To sequence the entire BAC contig representing the HBM candidate region, a set of 
BACs was chosen for subcloning into plasmid vectors andsubsequent DNA sequencing of 
these subclones. Since the DNA cloned in the BACs represents genomic DNA, this 
sequencing is referred to as genomic sequencing to distinguish it from cDNA sequencing. To 
20 initiate the genomic sequencing for a chromosomal region of interest, several non- 
overlapping BAC clones are chosen. DNA for each BAC clone is prepared, and the clones 
are sheared into random small fragments which are subsequently cloned into standard 
plasmid vectors such as pUC18. The plasmid clones are then grown to propagate the smaller 
fragments, and these are the templates for sequencing. To ensure adequate coverage and 
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sequence quality for the BAC DNA sequence, sufficient plasmid clones are sequenced to 
yield six-fold coverage of the BAC clone. For example, if the BAC is 100 kilobases long, 
' then phagemids are sequenced to yield 600 kilobases of sequence. Since the BAC DNA was 
randomly sheared prior to cloning in the phagemid vector, the 600 kilobases of raw DNA 
5 sequence can be assembled by computational methods into overlapping DNA sequences 
termed sequence contigs. For the purposes of initial gene identification by computational 
methods, six-fold coverage of each BAC is sufficient to yield ten to twenty sequence contigs 
of 1000 base pairs to 20,000 base pairs. 

The sequencing strategy employed in this invention was to initially sequence "seed" 

10 BACs from the BAC contig in the HBM candidate region. The sequence of the "seed" BACs 
was then used to identify minimally overlapping BACs from the contig, and these were 
subsequently sequenced. In this manner, the entire candidate region was sequenced, with 
several small sequence gaps left in each BAC. This sequence served as the template for 
computational gene identification. One method for computational gene identification is to 

15 compare the sequence of BAC contig to publicly available databases of cDNA and genomic 
sequences, e.g. unigene, dbEST, genbank. These comparisons are typically done using the 
# BLAST family of computer algorithms and programs (Altschul et al, J. Mol Biol, 215:403- 
410 (1990)). The BAC sequence can also be translated into protein sequence, and the protein 
sequence can be used to search publicly available protein databases, using a version of 

20 BLAST designed to analyze protein sequences (Altschul et al, Nucl Acids Res., 25:3389- 
3402 (1997)). Another method is to use computer algorithms such as MZEF (Zhang, Proc. 
Natl Acad. Set, 94:565-568 (1997)) and GRAIL (Ubefbacher et al, Methods Enzymol, 
266:259-28 1 (1996)), which predict the location of exons in the sequence based on the 
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presence of specific DNA sequence mohts that are common to all exons, as well as the 
presence of codon usage typical of human protein encoding sequences. 

In addition to identifying genes by computational methods, genes were also identified 
by direct cDNA selection (Del Mastro et aL, Genome Res. 5(2): 185-194 (1995)). Indirect 

5 cDNA selection, cDNA pools from tissues of interest are prepared, and the BACs from the 
candidate region are used in a liquid hybridization assay to capture the cDNAs which base 
pair to coding regions in the BAC. In the methods described herein, the cDNA pools were 
created from several different tissues by random priming the first strand cDNA from polyA 
RNA, synthesizing the second strand cDNAby standard methods, and adding linkers to the 

10 ends of the cDNA fragments. The linkers are used to amplify the cDNA pools. The BAC 
clones are used as a template for in vitro DNA synthesis to create a biotin labelled copy of the 
BAC DNA The biotin labelled copy of the BAC DNA is then denatured and incubated with 
an excess of the PCR amplified, linkered cDNA pools which have also been denatured. The 
BAC DNA and cDNA are allowed to anneal in solution, and hetero duplexes between the 

15 BAC and the cDNA are isolated using streptavidin coated magnetic beads. The cDNAs that 
are captured by the BAC are then amplified using primers complimentary to the linker 
sequences, and the hybridization/selection process is repeated for a second round. After two 
rounds of direct cDNA selection, the cDNA fragments are cloned, and a library of these 
direct selected fragments is created. 

20 The cDNA clones isolated by direct selection are analyzed by two methods. Since a 

pool of BACs from the HBM candidate region is used to provide the genomic DNA 
sequence, the cDNAs must be mapped to individual BACs. This is accomplished by arraying 
the BACs in microtiter dishes', and replicating their DNA in high density grids. Individual 
cDNA clones are then hybridized to the grid to confirm that they have sequence identity to an 
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individual BAC from the set used for direct selection, and to determine the specific identity 
of that BAC. cDNA clones that are confirmed to correspond to individual B ACs are 
sequenced. To determine whether the cDNA clones isolated by direct selection share 
sequence identity or similarity to previously identified genes, the DNA and protein coding 
5 sequences are compared to publicly available databases using the BLAST family of 
programs. 

The combination of genomic DNA sequence and cDNA sequence provided by BAC 
sequencing and by direct cDNA selection yields an initial list of putative genes in the region. 
The genes in the region were aU candidates for the HBM locus. To further characterize each 
10 gene, Northern blots were performed to determine the size of the transcript corresponding to 
each gene, and to determine which putative exons were transcribed together to make an 
individual gene. For Northern blot analysis of each gene, probes were prepared from direct 
selected cDNA clones or by PCR ampnfying specific fragments from genomic DNA or from 
the BAC encoding the putative gene of interest. The Northern blots gave information on the 
15 size of the transcript and the tissues in which it was expressed. For transcripts which, were 
not highly expressed, it was sometimes necessary to perform a reverse transcription PCR 
assay using RNA from the tissues of interest as a templatejpr the reaction. 

. Gene identification by computational methods and by direct cDNA selection provides 
unique information about the genes in a region of a chromosome. When genes are identified, 
20 then it is possible to examine different individuals for mutations in each gene. 

I. Phenotyping using DXA Measurements 

Spinal bone mineral content (BMC) and bone mineral density (BMD) measurements 
performed at Creighton University (Omaha, Nebraska) were made by DXA using a Norland 
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Instruments densitometer (Norland XR2600 Densitometer, Dual Energy X-ray 
Absorptiometry, DXA). Spinal BMC and BMD at other locations used the machinery 
available. There are estimated to be 800 DXA machines currently operating in the U.S. Most 
larger cities have offices or imaging centers which have DXA capabilities, usually a Lunar or 
5 Hologic machine. Each location that provided spine BMC and BMD data included copies of 
the printouts from their machines to provide verification that the regions of interest for 
measurement of BMD have been chosen appropriately. Complete clinical histories and 

skeletal radiographs were obtained. 

The HBM phenotype is defined by the following criteria: very high spinal BMD; a 
10 clinical history devoid of any known high bone mass syndrome; and skeletal radiographs 
showing a normal shape of the appendicular skeleton. 

n. Genotyping of Microsatellite Markers 

To narrow the genetic interval to a region smaller than that originally reported by 
Johnson et al, Am. J. Hum. Genet., 60:1326-1332 (1997), additional microsatellite markers 
15 on chromosome llql2-13 were typed. The new markers included: D11S4191.D11S1883, 
D11S1785,D11S4113,D11S4136,D11S4139, (Dib, et a^Nature, 380:152-154 (1996), 
FGF3 (Tolymeropolous, et al, Nucl Acid Res., 18:7468 (1990)), as well as 
GTC_HBM_Marker_l, GTC_HBM_Marker_2, GTC_HBM_Marker_3 , 
GTC_HBM_Marker_4, GTC_HBM_Marker_5, GTC_HBM_Marker_6, and 
20 GTC_HBM_Marker_7 (See Fig. 2). 

Blood (20 ml) was drawn into lavender cap (EDTA containing) tubes by a certified 
phlebotomist. The blood was stored refrigerated until DNA extraction. DNA has been 
extracted from blood stored for up to 7 days in the refrigerator without reduction in the 
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quality or quantity of yield. .For those subjects that have blood drawn at distant sites, a 
shipping protocol was successfully used on more than a dozen occasions. Blood samples 
were snipped by overnight express in a styrofoam container with freezer packs to provide 
cooling. Lavender cap tubes were placed on individual plastic shipping tubes and then into 
5 "zip-lock" biohazard bags. When the samples arrived the next day, they were immediately 
processed to extract DNA. 

The DNA extraction procedure used a kit purchased from Gentra Systems, Inc. 
(Minneapolis, Minnesota). Briefly, the procedure involved adding 3 volumes of a red blood 
cell lysis buffer to the whole blood. After incubations for 10 minutes at room temperature, 
10 the solution was centrifuged in a Beckman tabletop centrifuge at 2,000 X g for 10 minutes. 
The white blood cell pellet was resuspended in Cell Lysis Buffer. Once the pellet was 
completely resuspended and free of cell clumps, the solution was digested with KNase A for 
15 minutes at 37°C. Proteins were precipitated by addition of the provided Protein 
Precipitation Solution and removed by centrifugation. The DNA was precipitated out of the 
15 supernatant by addition of isopropanol. This method was simple and fast, requiring only 1-2 
hours, and allowed for the processing of dozens of samples simultaneously. The yield of 
DNA was routinely >8 mg for a 20 ml sample of whole blood and had a MW of >50 kb. 
DNA was archived by storing coded 50 fig aliquots at -80 °C as an ethanol precipitate. 

DNA was genotyped using one fluorescently labeled oligonucleotide primer and one 
20 unlabeled oligonucleotide primer. Labeled and unlabeled oligonucleotides were obtained 
from Integrated DNA Technologies, Inc. (Coralville, Iowa). All other reagents for 
microsatelhte genotyping were purchased from Perkin Elmer- Applied Biosystems, Inc. ("PE- 
ABF) (Norwalk, Connecticut). Individual PCR reactions were performed for each marker, as 
described by PE-ABI using AmpliTag DNA Polymerase. The reactions were added to 3.5 iA 
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of loading buffer containing deionized formamide, blue dextran and TAMRA 350 size 
standards (PE-ABI). After heating at 95°C for 5 minutes to denature the DNA, the samples 
were loaded and electrophoresed as described in the operator's manual for the Model 377 
DNA Sequencer (PE-ABI, Foster City, California). After gel electrophoresis, the data was 
5 analyzed using PE-ABI GENESCAN™ and GENOTYPER™ software. First, within the 
GENES CAN™ software/the lane tracking was manually optimized prior to the first step of 
analysis. After the gel lane data was extracted, the standard curve profiles of each lane were 
exammedandvermedforlmearityandsizecalling. Lanes, which had problems with either 
of these parameters, were re-tracked and verified. Once all lanes were tracked and the size 
10 standaxdswerecorxec^^ 

identification To expedite allele calling (binning), the program Linkage Designer from the 
Internet web-site of Dr. Guy Van Camp ^J^^^o.^6n^m^m was used. 
This program greatly facilitates the importing of data generated by GENOTYPER™ into me 
pedigree drawing program CyrilHc (Version2.0, Cherwell Scientmc Pubhshing Limited, 
15 Oxford, Great Britain) and subsequent linkage analysis using the program LINKAGE 
(Lathrop et al, Am. J. Hum. Genet, 37:482-498 (1985)). 

m. Linkage Analysis 

Fig. 1 demonstrates the pedigree of the individuals used in the genetic linkage studies 
f 0r this invention. Specifically, two-point linkage analysis was performed using the MLINK 
20 and LINKMAP components of the program LINKAGE (Lathrop et al, Am. J. Hum. Genet, 
37:482-498 (1985)). Pedigree/marker data was exported from Cyrillic as a pre-file into the 
Makeped program and converted into a suitable ped-file for linkage analysis. 
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The original linkage analysis was performed using three models: (i) an autosomal 
dominant, fully penetrant model, (ii) an autosomal dominant model with reduced penetrance, 
and (iii) a quantitative trait model. The HBM locus was mapped to chromosome 1 lql2-13 
by analyzing DNA for linked markers from 22 members of a large, extended kindred. A 
5 highly automated technology was used with a panel of 345 fluorescent markers which 

spanned the 22 autosomes at a spacing interval ranging from 6-22 cM. Only markers from 
this region of chromosome 11 showed evidence of linkage (LOD score -3.0). The highest 
LOD score (5.74) obtained by two-point and multipoint analysis was Dl 1S987 (map position 
55 in Fig. 2). The 95% confidence interval placed the HBM locus between markers Dl 1S905 
10 and Dl 1S937 (map position 41-71 in Fig. 2). Haplotype analysis also places the Zmaxl gene 
in this same region. Further descriptions of the markers Dl 1S987, Dl 1S905, and Dl 1S937 
can be found in Gyapay et ah, Nature Genetics, Vol. 7, (1994). 

In this invention, the inventors report the narrowing of the HBM interval to the region 
between markers Dl 1S987 and GTC_HBM_Maxker__5 . These two markers he between the 
15 delimiting markers from the original analysis (D11S11S905 andDl 1S937) and are 

approximately 3 cM from one another. The narrowing of the interval was accomplished 
using genotypic data from the markers D11S4191,D11S1883 S D11S1785, D11S4113, 
D11S4136, D11S4139, (Dib et al, Nature, 380:152-154 (1996)), FGF3 (Polymeropolous et 
al, Nucl Acid Res., 18:7468 (1990)) (information about the genetic markers can be found at 
20 the internet site of the Genome Database, http://gdbwww.gdb.org/), as well as the markers 
GTC_HBM JMarker_l , GTC_HBM_Marker_2, GTCJHBM_Marker_3, 
GTCJ£ffiM__Marker_4, GTC_HBM_Marker_5, GTC_HBM_Marker_6, and 
GTC_HBM_Marker_7. 
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As shown in Fig, 1, haplotype analysis with the ahove genetic markers identifies 
recombination events (crossovers) in individuals 9019 and 9020 that significantly refine the 
interval of chromosome 1 1 to which the Zmaxl gene is localized. Individual 9019 is an 
HBM-affected individual that inherits a portion of chromosome 1 1 from the maternal 

5 chromosome with the HBM gene, and a portion from the chromosome 1 1 homologue. The 
portion inherited from the HBM gene-carrying chromosome includes markers Dl 1S935, 
D11S1313, GTC_HBM_Marker_4, D11S987,D11S1296, GTC_HBM_Marker_6, 
GTC_HBM_Marker_2, D11S970, GTC_HBM_Marker_3, D11S4113, 
GTC_HBM_Marker_l , GTC_HBM_Marker_7 and GTC_HBM_Marker_5 . The portion 

10 from D11S4136 and continuing in the telomeric direction is derived from the non-HBM 
chromosome. This data places the Zmaxl gene in a location centromeric to the marker 
GTC_HBM_Marker_5. Individual 9020 is an unaffected individual who also exhibits a 
critical recombination event. This individual inherits a recombinant paternal chromosome 1 1 
that includes markers D11S935, D11S1313, GTC_HBM_Marker_4, D11S987, D11S1296 

15 and GTC_HBM_Marker_6 from her father's (individual 0115) chromosome 1 1 homologue 
that carries the HBM gene, and markers GTC_HBM_Marker_2, Dl 1S970, 
GTC_HBM_Marker_3, GTC_HBM_Marker_l , GTC_HBM_Marker_7, 
GTC_HBM_Marker_5,DllS4136, D11S4139, D11S1314, andDHS937 from her father's 
chromosome 11 that does not carry the HBM gene. Marker D11S41 13 is uninformative due 

20 to its homozygous nature in individual 0115. This recombination event places the 
centromeric boundary of the HBM region between markers Dl 1S1296 and Dl 1S987. 

Two-point linkage analysis was also used to confirm the location of the Zmaxl gene 
on chromosome 11. The linkage results for two point linkage analysis under a model of full 
penetrance are presented in Table 1 below. This table lists the genetic markers in the first 
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. • * a(W « the top of the table. Each cell of the column 
column and the recombination fractions across the top 01 

snows tie LOD score for a, ^ — «— *» »*» '° ^ «" * ^ 

occrasltm a*erDnS970, wU chiswi«Hnt i cin. K va ld e fin edb y haplo^ m alys 1 , 

TABLE 1 




A single nucleotide polymorphism (SNP) further defines the HBM region. This SNP 
2Q GTC HBM M^er.5. This SKP U telonreric to .e g en* rn^er OTC_HBM.^ 

• ♦ *«<INP The primer sequence for the genetic markers 
that the crossover is centxomenc to this SNP. The prime 

, , * attc TTBM Marker 7 is shown in Table 2 below. 
GTC_HBM_Marker_5 and GTC_tLBM_MarjLcr_/ 
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TABLE 2 



Marker 


Primer (Forward) 


Primer (Reverse) 


GTC HBM Marker 5 


TTTTGGGTACACAATTCAGTCG 


AAAACTGTGGGTGCTTCTGG 


GTC HBM Marker 7 


GTGATTGAGCCAATCCTGAGA 


TGAGCCAAATAAACCCCTTCT 



5 The kindred described have several features of great interest, the most important being 

that theirbones, while very dense, have an absolutely normal shape. The outer dimensions of 
the skeletons of the HBM-affected individuals are normal, and, while medullary cavities are 
present, there is no interference with hematopoiesis. The HBM-affected members seem to be 
resistant to fracture, and there are no neurologic symptoms, and no symptoms of impairment 
10 ofanyorganorsystemfunctionmmemembersexamined. HBM-affected members of the 
kindred live to advanced age without undue illness or disability. Furthermore, the HBM 
phenotype matches no other bone disorders such as osteoporosis, osteoporosis pseudoglioma, 
Engelmann's disease, Ribbing's disease, hyperphosphatasemia, VanBuchem's disease, 
melorheostosis, osteopetrosis, pycnodysostosis, sclerostenosis, osteopoikilosis, acromegaly, 
15 Pagef s disease, fibrous dysplasia, tubular stenosis, osteogenesis imperfecta,- 

hypoparathyroidism, pseudohypoparathyroidism, pseudopseudohypoparathyroidism, primary 
and secondary hyperparathyroidism and associated syndromes, hypercalciuria, meduUary. 
carcinoma of the thyroid gland, osteomalacia and other diseases. Clearly, the HBM locus in 
this family has a very powerful and substantial role in regulating bone density, and its 
20 identification is an important step in understanding the pathway(s) that regulate bone density 
and the pathogenesis of diseases such as osteoporosis. 

In addition, older individuals carrying the HBM gene, and therefore expression of the 
HBM protein, do not show loss of bone mass characteristic of normal individuals. Moreover, 
individuals carrying the HBM gene have lower triglycerides, VUDLs, and LDLs and/or 
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increased HDLs. In other words, the HBM gene is a suppressor of osteoporosis and may 
lessen cardiovascular risk arteriosclerotic and/or atherosclerotic associated conditions. In 
essence, individuals carrying the HBM gene are dosed with the HBM protein, and, as a result, 
lower levels of detrimental lipids (e.g., VLDL, LDL and triglycerides). This in vivo 
5 observation is strong evidence that treatment of normal individuals with the HBM gene or 
protein, or a fragment thereof, will ameliorate osteoporosis and arterio- or atherosclerotic 
conditions or diseases. 

IV. Physical Mapping 

To provide reagents for the cloning and characterization of the HBM locus, the 
10 genetic mapping data described above were used to construct a physical map of the region 
containing Zmaxl on chromosome llql3.3. The physical map consists of an ordered set of 
molecular landmarks, and a set of BAC clones that contain the Zmaxl gene region from 
chromosome llql3.3. 

Various publicly available mapping resources were utilized to identify existing STS 
15 markers (Olson et al., Science, 245:1434-1435 (1989)) in the HBM region. Resources 
included the GDB, the Whitehead Institute Genome Center,. dbSTS and dbEST (NCBI), 
1 Idb, the University of Texas Southwestern GESTEC, the Stanford Human Genome Center, 
and several ^literature references (Courseaux etaL, Genomics, 40:13-23 (1997), Courseaux et 
al, Genomics, 37:354-365 (1996), Guru et al, Genomics, 42:436-445 (1997), Hosoda et al, 
20 Genes Cells, 2:345-357 (1997), James et al, Nat. Genet, 8:70-76 (1994), Kitamura et al, 
DNA Research, 4:281-289 (1997), Lemmens etaL, Genomics, 44:94-100 (1997)/ Smith etal, 
Genome Res., 7:835-842 (1997)). Maps were integrated manually to identify markers 
mapping to the region containing Zmax 1 . 
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Primers for existing STSs were obtained from the GDB or literature references are 
listed in Table 3 below. Thus, Table 3 shows the STS markers used to prepare the physical, 
map of the Zmaxl gene region. 
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Novel STSs were developed either from publicly available genomic sequence or from 
sequence-derived BAC insert ends. Primers were chosen using a script which automatically 
performs vector and repetitive sequence masking using Crossmatch (P. Green, U. of 
Washington) and subsequent primer picking using Primer3 (Rozen, Skaletsky (1996, 1997). 
5 Primer3 is available at www.genome.wi.mit. edu/genome_software/omer/primer3.html. 

Polymerase chain reaction (PCR) conditions for each primer pair were initially 
optimized with respect to MgCl 2 concentration. The standard buffer was 10 mM Tris-HCl 
(pH 8.3), 50 mM KC1, MgCl 2 , 0.2 mM each dNTP, 0.2 fjM each primer, 2.7 ngfcl human 
DNA, 0.25 U of AmpliTaq (Perkin Elmer) and MgCl, concentrations of 1.0 mM, 1.5 mM, 
10 2.0 mM or 2.4 mM. Cycling conditions included an initial denaturation at 94°C for 2 
" minutes followed by 40 cycles at 94°C for 15 seconds, 55°C for 25 seconds, and 72°C for 25 
seconds followed by a final extension at 72°C for 3 minutes. Depending on the results from 
the initial round of optimization the conditions were further optimized if necessary. 
Variables included increasing the annealing temperature to 58°C or 60°C, increasing the 
15 cycle number to 42 and the annealing and extension times to 30 seconds, and using 
AmpliTaqGold (Perkin Elmer). 

BAC clones (Kim et al, Genomics, 32:213-218 (1996), Shizuya et aU Proc. Natl 
Acad. Sci. USA, 89:8794-8797 (1992)) containing STS markers of interest were obtained by 
PCR-based screening of DNA pools from a total human BAC library purchased from 
20 Research Genetics. DNA pools derived from library plates 1-596 were used corresponding to 
nine genomic equivalents of human DNA. The initial screening process involved PCR 
reactions of individual markers against superpools, i.e., a mixture of DNA derived from all 
BAC clones from eight 3 84-well library plates. For eacn positive superpool, plate (8), row 
(16) and column (24) pools were screened to identify a unique library address. PCR products 
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were eiectrophoresed m 2% agarose gels (Sigma) containing 0.5 >igtal cOndium bromide in 
IX TBE at 150 volts for 45 min. The electrophoresis units used were the Model A3-1 
systems from Owl Scientific Products. Typically, gels contained 10 tiers of lanes with 50 
wells/tier. Molecular weight markers (100 bp ladder, Life Technologies, Bethesda, MD) 
5 were loaded at both ends of the gel. Images of the gels were captured with a Kodak DC40 
CCDcameraandprocessedwithKodaklDsoftware: The gel data were exported as tab 
delimited text files; names of the files included information about the library screened, the gel 
image files and the marker screened. These data were automatically imported using a 
customized Perl script into Filemaker™ PRO (Claris Corp.) databases for data storage and 
10 analysis. In cases where incomplete or ambiguous clone address information was obtained, 
additional experiments were performed to recover a unique, complete library address. 

Recovery of clonal B AC cultures from the library involved streaking out a sample 
from the library well onto LB agar (Maniatis et aL, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982)) containing 12.5 
15 ug/ml chloramphenicol (Sigma). Two individual colonies and a portion of the initial streak 
* quadrant were tested with appropriate STS markers by colony PCR for verification Positive 
clones were stored in LB broth containing 12.5 ug/ml chloramphenicol and 15% glycerol at - 
70°C. 

Several different types of DNA preparation methods were used for isolation of B AC 
20 DNA. The manual alkaline lysis miniprep protocol listed below (Maniatis et aL, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1982)) was successfully used for most applications, i.e., restriction mapping, CHEF gel 
analysis, FISH mapping, but was not successfully reproducible in endsequencing. The 
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Autogen and Qiagen protocols were used specihcally tor J3AU DNA preparation tor 
endsequencktg purposes. 

Bacteria were grown in 15 ml Terrific Broth containing 12.5 ug/ml chloramphenicol 
in a 50 ml conical tube at 37°C for 20 hrs with shaking at 300 rpm. The cultures were 
5 centrifuged in a Sorvall RT 6000 D at 3000 rpm (~1 800 g) at 4°C for 15 mho. The 

supernatant was then aspirated as completely as possible. In some cases ceU pellets were 
frozen at -20°C at this step for up to 2 weeks. The pellet was then vortexed to homogenize 
the cells and mmimize clumping. 250 ul of PI solution (50 mM glucose, 15 mM Tris-HCL 
pH 8, 10 mM EDTA, and 100 ug/ml RNase A) was added and the mixture pipetted up and 
10. down to mix. The mixture was then transferred to a 2 ml Eppendorf tube. 350ulofP2 

solution (0.2 N NaOH, 1% SDS) was then added, the mixture mixed gently and incubated for 
5 min. at room temperature. 350 ul of P3 solution (3M KOAc, pH 5.5) was added and the 
mixture mixed gently until a white precipitate formed. The solution was incubated on ice for 
5 min. and then centrifuged at 4°C in a microfuge for 10 min. The supernatant was 
15 transferred carefully (avoiding the white precipitate) to a fresh 2 ml Eppendorf tube, and 0.9 
ml of isopropanol was.added, the solution mixed and left on ice for 5 rnin. The samples were 
centrifuged for 1 0 min., and the supernatant removed carefully. Pellets were washed in 70% 
ethanol and air dried for 5 min. Pellets were resuspended in 200 ul of TE8 (10 mM Tris-HCl, 
pH 8.0, 1.0 mM EDTA), and RNase A (Boehringer Mannheim) added to 1 00 ug/ml. 
20 Samples were incubated at 37°C for 30 min., then precipitated by addition of 

C2H 3 0 2 Na-3H 2 0 to 0.5 M and 2 volumes of ethanol. Samples were centrifuged for 10 min., 
and the pellets washed with 70% ethanol followed by air drying and dissolving in 50 fd TE8. 
Typical yields for this DNA prep were 3-5 ug/15 ml bacterial culture. Ten to 15 ul were used 
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for HindHI restriction analysis; 5 jil was used for NotI digestion and clone insert sizing by 
CHEF gel electrophoresis. 

BACs were inoculated into 15 ml of 2X LB Broth containing 12.5 fig/ml 
chloramphenicol in a 50 ml conical tube. 4 tubes were inoculated for each clone. Cultures 
5 were grown overnight (-16 hr) at 37°C with vigorous shaking (>300 rpm). Standard 
conditions for BAC DNA isolation were followed as recommended by the Autogen 740 
manufacturer. 3 ml samples of culture were placed into Autogen tubes for a total of 60 ml or 
20 tubes per clone. Samples were dissolved finally in 100 jil TE8 with 15 seconds of shaking 
as part of the Autogen protocol. After the Autogen protocol was finished DNA solutions 

10 were transferred firom each individual tube and pooled into a 2 ml Eppendorf tube. Tubes 
with large amounts of debris (carry over from the pelleting debris step) were avoided. The 
tubes were then rinsed with 0.5 ml of TE8 successively and this solution added to the pooled 
material. DNA solutions were stored at 4°C; clumping tended to occur upon freezing at - 
20 °C. This DNA was either used directly for restriction mapping, CHEF gel .analysis or 

15 FISH mapping or was further purified as described below for use in endsequencing reactions. 
The volume of DNA solutions was adjusted to 2 ml with TE8, samples were then 
mixed gently and heated at 65 °C for 10 min. The DNA solutions were then centriftiged at 
4° C for 5 min, and the supernatants transferred to a 15 ml conical tube. The NaCl 
concentration was then adjusted to 0.75 M (-0.3 ml of 5 M NaCl to the 2 ml sample). The 

20 " total volume was then adjusted to 6 ml with Qiagen column equilibration buffer (Buffer 

QBT). The supernatant containing the DNA was then applied to the column and allowed to 
enter by gravity flow. Columns were washed twice with 10 ml of Qiagen Buffer QC. Bound 
DNA was then eluted with four separate 1 ml aliquots of Buffer QF kept at 65 °C. DNA was 
precipitated with 0.7 volumes of isopropanol (-2.8 ml). Each sample was then transferred to 
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4 individual 2.2 ml Eppendorf tubes and incubated at room temperature for 2 hr or overnight. 
Samples were centrifuged in a microfuge for 10 min. at 4°C. The supernatant was removed 
carefully and 1 ml of 70% ethanol was added. Samples were centrifuged again and because 
the DNA pellets were often loose at this stage, the supernatant removed carefully. Samples 
5 were centrifuged again to concentrate reinaining liquid which was removed with a micropipet 
tip. DNA pellets were then dried in a desiccator for 10 min. 20 & of sterile distilled and 
deionized H 2 0 was added to each tube which was then placed at 4°C overnight The four 20 
/A samples for each clone were pooled and the tubes rinsed with another 20 iA of sterile 
distilled and deionized H 2 0 for a final volume of 100 yL Samples were then heated at 65 °C 
10 for 5 rnin. and then mixed gently. Typical yields were 2-5 ^60 ml culture as assessed by 
NotI digestion and comparison with uncut lambda DNA. 

3 ml of LB Broth containing 12.5 /ig/ml of chloramphenicol was dispensed into 
autoclavedAutogen tubes. A single tube was used for each clone. For inoculation, glycerol 
stocks were removed from -70°C storage and placed on dry ice. A small portion of the 
15 glycerol stock was removed from the original tube with a sterile toothpick and transferred 
into the Autogen tube; the toothpick was left in the Autogen tube for at least two minutes 
before discarding. After inoculation the tubes were covered.with tape making sure the seal . 
was tight. When all samples were inoculated, the tube units were transferred into an Autogen 
rack holder and placed into a rotary shaker at 37 °C for 16-17 hours at 250 rpm. Following 
20 growth, standard conditions for BAC DNA preparation, as defined by the manufacturer, were 
used to program the Autogen. Samples were not dissolved in TE8 as part of the program and 
DNA pellets were left dry. When the program was complete, the tubes were removed from 
the output tray and 30 ul of sterile distilled and deionized H 2 0 was added directly to the 
bottom of the tube. The tubes were then gently shaken for 2-5 seconds and then covered with 
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parafilm and incubated at room temperature for 1-3 hours. DNA samples were then 
transferred to an Eppendorf tube and used either directly for sequencing or stored at 4°C for 
later use. 

V. BAC Clone Characterization for Physical Mapping 
5 DNA samples prepared either by manual alkaline lysis or the Autogen protocol were 

digested with HindHT for analysis of restriction fragment sizes. This data were used to 
compare the extent of overlap among clones. Typically 1-2 jig were used for each reaction. 
Reaction mixtures included: IX Buffer 2 (New England Biolabs), 0.1 mg/ml bovine serum 
albumin (New England Biolabs), 50 fig/ml RNase A (Boehringer Mannheim), and 20 units of 

10 Hindin (New England Biolabs) in a final volume of 25 jd. Digestions were incubated at 
37 °C for 4-6 hours. BAC DNA was also digested with NotI for estimation of insert size by 
CHEF gel analysis (see below). Reaction conditions were identical to those for Hindm 
except that 20 units of NotI were used. Six /zl of 6X Ficoll loading buffer containing 
bromphenol blue and xylene cyanol was added prior to electrophoresis. 

15 HmdlH digests were analyzed on 0.6% agarose (Seakem, FMC Byproducts) in IX 

TBE containing 0.5 ^ig/ml ethidium bromide. Gels (20 cmX 25 cm) were electrophoresed in 
a Model A4 electrophoresis unit (Owl Scientific) at 50 volts for 20-24 hrs. Molecular weight 
size markers included undigested lambda DNA, Hindin digested lambda DNA, and HaeHI 
digested _X174 DNA. Molecular weight markers were heated at 65 °C for 2 min. prior to 
20 loading the gel. Images were captured with a Kodak DC40 CCD camera and analyzed with . 
Kodak ID software. 

NotI digests were analyzed on a CHEF DRII (BioRad) electrophoresis unit according * 
to the manufacturer's recommendations. Briefly, 1% agarose gels (BioRad pulsed field 
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grade) were prepared in 0.5X TBE, equilibrated for 30 minutes in the electrophoresis unit at 
14°C, and electrophoresed at 6 volts/cm for 14 hrs with circulation. Switching times were 
ramped from 10 sec to 20 sec. Gels were stained after electrophoresis in 0.5 ug/ml etmdium 
bromide. Molecular weight markers included undigested lambda DNA, HmdHI digested 
5 lambda DNA, lambda ladder PFG ladder, and low range PFG marker (all from New England 
Biolabs). 

B AC DNA prepared either by the manual alka l i n e lysis or Autogen protocols were 
labeled for FISH analysis using a Bioprime labeling kit (BioRad) according to the 
manufacturer's recommendation with minor modifications. Approximately 200 ng of DNA 
10 was used for each 50 ul reaction. 3 ul were analyzed on a 2% agarose gel to determine the 
extent of labeling. Reactions were purified using a Sephadex G50 spin column prior to in 
situ hybridization. Metaphase FISH was performed as described (Ma et a!., Cytogenet. Cell 
Genet., 74:266-271 (1996)). 

VI. BAC Endsequencing 

15 The sequencing of BAC insert ends utilized DNA prepared by either of the two 

methods described above. The DYEnamic energy transferprimers and Dynamic Direct cycle 
sequencing kits from Amersham were used for sequencing reactions. Ready made 
sequencing mix including the,M13 -40 forward sequencing primer was used (Catalog # 
US79730) for the T7 BAC vector terminus; ready made sequencing mix (Catalog # 

20 US79530) was mixed with the M13 -28 reverse sequencing primer (Catalog # US79339) for 
the SP6 BAC vector terminus. The sequencing reaction mixes included one of the four 
fluorescently labeled dye-primers, one of the four dideoxy terrninatipn mixes, dNTPs, 
reaction buffer, and Thermosequenase. For each BAC DNA sample, 3 ul of the BAC DNA 
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sample was aliquoted to 4 PCR strip tubes. 2 ul of one of the four dye primer/tennination 
mix combinations was then added to each of the four tabes. The tubes were then sealed and 
centrifuged briefly prior to PCR. Thermocycling conditions involved a 1 minute denaturation 
at 95°C, 15 second annealing at 45°C, and extension for 1 minute at 70°C for 35 total cycles. 
5 After cycling the plates were centrifuged briefly to collect all the Uquid to the bottom of the 
tubes. 5 /A of sterile distilled and deionizedH 2 0 was then added into each tube, the plates 
sealed and centrifuged briefly again. The four samples for each B AC we then pooled 
together. DNA was then precipitated by adding 1.5 [A of 7.5 M NH4OAC and 100 {A of- • 
20°C 100% ethanol to each tube. Samples were mixed by pipetting up and down once. The 
10 plates were then sealed and incubated on ice for 10 minutes. Plates were centrifuged in a 
table top Haraeus centrifuge at 4000 rpm (3,290 xg) for 30 minutes at 4°C to recover the 
DNA. Thesupenmtantwasremovedandexcessliquidblottedontopapertowels. Pellets 
were washed by adding 100 tA of -20° C 70% ethanol into each tube and recentrifuging at 
4000 rpm (3,290 xg) for 10 minutes at 4°C. The supernatant was removed and excess liquid 
15 again removed by blotting on a paper towel. Remaining traces of liquid were removed by 
placing the plates upside down over a paper towel and centrifuging only until the centrifuge 
reached 800 rpm. Samples were then air dried at room temperature for 30 min. Tubes were 
capped and stored dry at -20°C until electrophoresis. Immediately prior to electrophoresis 
the DNA was dissolved in 1 .5 fA of Amersham loading dye. Plates were then sealed and 
20 centrifuged at 2000 rpm (825 xg). The plates were then vortexed on a plate shaker for 1-2 . 
minutes. Samples were then recentrifuged at 2000 rpm (825 xg) briefly. Samples were then 
heated at 65 0 C for 2 min. and immediately placed on ice. Standard gel electrophoresis was 
performed on ABI 377 fluorescent sequencers according to the manufacturer's 
recommendation. 
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VIL Sub-cloning and Sequencing of HBM B AC 3>NA 

The physical map of the Zmaxl gene region provides a set of BAC clones that contain 
within them the Zmcvcl gene and the HBM gene. DNA sequencing of several of the BACs 
from the region has been completed. The DNA sequence data is a unique reagent that 
5 includesdatathatoneslcmedintheartcanuseto identify the Zmoxi gene and the HBM gene, 
or to prepare probes to identify the gene(s), or to identify DNA sequence polymorphisms that 

identify the geue(s). 

BAC DNA was isolated according to one of two protocols, either a Qiagen 
purification of BAC DNA (Qiagen, Inc. as described in the product literature) or a manual 
10 purification which is a modification of the standard alkaline lysis/Cesium Chloride 
preparation of plasmid DNA (see e.g,, Ausubel * aU Current Protocols in Molecular 
Biology, John Wiley & Sons (1997)). Briefly for the manual protocol, cells were pelleted, 
resuspended in GTE (50 mM glucose, 25 mM Tris-Cl (pH 8), 10 mM EDTA) and lysozyme 
(SOmg/m! sohrtion), followed by NaOH/SDS (1% SDS/0.2N NaOH) and then an ice-cold 
15 solution of3MKOAc(pH 4.5-4.8). EnaseA was added to the filtered supernatant, Mowed 
b yProteina S eKand20%SDS. The DNA was then precipitated with isopropanol, dried "and 
resuspended in TE (10 mM Tris, 1 mM EDTA (pH 8.0)). The BAC DNA was further 
purified by Cesium Chloride density gradient centrifugation (Ausubel et aL, Current 
Protocols in Molecular Biology, John Wiley & Sons (1997)). 
20 ' Fouowtog isolation, meBAC DNA was sheared hy^ 

(Hengen, IM in Biochem. ScL, 22:273-274 (W97)) to aninsert size of 2000-3000 bp. 
Aftershearing.meDNAwas.concentratedandseparatedonastandardiyoagarosegel. A 

. single fraction, corresponding to the approximate size, was excised from the gel and purified 
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by electroelution (Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring, NY (1989)). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. The 
blunt-ended DNA was then ligated to unique BstXI-linker adapters (5 f - 

5 GTCTTCACCACGGGG and 5' GTGGTGAAGAC in 100-1000 fold molar excess): These 
linkers were complimentary to the BstXI-cut pMPX vectors (constructed by the inventors), 
while the overhang was not self-complimentary. Therefore, the linkers would not 
concatemerize nor would the cut- vector religate itself easily. The linker-adapted inserts were 
separated from the unincorporated linkers on a 1% agarose gel and purified using GeneClean 

10 (BIO 101, Inc.). The linker-adapted insert was then ligated to a modified pBlueScript vector 
to construct a "shotgun" subclone library. The vector contained an out-of-frame lacZ gene at 
the cloning site which became in-frame in the event that an adapter-dimer is cloned, allowing 
these to be avoided by their blue-color. 

All subsequent steps were based on sequencing by ABI377 automated DNA 

15 sequencing methods. Only major modifications to the protocols are highlighted. Briefly, the 
library was then transformed into DH5a competent cells (Life Technologies, Bethesda, MD, 
DH5a transformation protocol). It was assessed by plating onto antibiotic plates containing 
ampicillin and IPTG/XgaL The plates were incubated overnight at 37 °C. Successful 
transform ants were then used for plating of clones and picking for sequencing. The cultures 

20 were grown overnight at 37 °C. DNA was purified using a silica bead DNA preparation (Ng 
et aL, Nucl Acids Res., 24:5045-5047 (1996)) method. In this manner, 25 of DNA was 
obtained per clone. 

These purified DNA samples were then sequenced using ABI dye-terminator 
chemistry. The ABI dye terminator sequence reads were run on ABI377 machines and the 
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data was directly «M t. UNTX machines following lane hacking of 4. gels. All reads 
were assembled using PHRAP (P. Green, Abstracts of DOE Human GenomeProgram 
Contractor-Grantee Workshop V, Jan. 1996, p.157) with default parameters and quality 
scores. The initial assembly was done a. 6-fold coverage and yielded an average of 8-15 
5 contigs. na^toW-^.'^^**™^^^'^ 

identification of additional overlapping contigs. Primers for walking were selected using a 
GenomeTherapeuticsprogramPick^ernearthe ends of the clones to facilitate gap 
closure. These walks were sequenced using tie selected clones and primers. Datawere 
10 reassembled withPHRAP into sequence contigs. 

Vm. Gene Identification by Computational Methods 

Following assembly of the BAG sequences into contigs, the contigs were subjected to 
computational analyses to identify coding regions and regions bearing DNA sequence 
similarity to known genes. This protocol included the following steps. 
u • 1. Degap the contigs: the sequence contigs often contain symbols (denotedby a 

periodsymbol) mat represent locations where the Mdividual.ABI sequence reads have 
fcaertions or deletions. Prior to automated computational analysis of the contigs, the penods 
wereremoved. The original data .was maintained for future reference. 

2 BAC vector sequences were "masked" within the sequence by using the 

shotoun Ubraries construction detailed above leaves some BAC vector in the shotgun 
libraries, this program was used to compare the sequence of the BAC contigs to the BAC 
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vectorandtomaskanyvectorseqnencepriortosubseduentsteps. Masked sequences were 
marked by an "X" in the sequence files, and remained inert during subsequent analyses. 
3 . R coli sequences contaminating the B AC sequences were masked by 

comparing the BAC contigs to the entire E. coli DNA sequence. 
5 4. ' Repetitiveelementsknowntobecornmonmthehumangenomeweremasked 

using cross match. In this implementation of crossmatch, the BAC sequence was compared 
to a database of human repetitive elements (Jerzy Jerka, Genetic Information Research 
Institute, Palo Alto, CA). The masked repeats were marked by X and remained inert during ■ 

subsequent analyses. ■ " 

10 5. Thelocalionofcxo^withMthesequenccwasptedicted^gtlieMZEF 

computer program (Zhang, Proc. Natl. Acad. Set, 94:565-568 (1997)). 

6. He sequence was compared to the publicly available unigene database 
(National Center for Biotechnology Information, National library of Medicine, 38A, 8N905, 
8600 Rockville Pike, Bethesda,MD 20894; www^cbi^nih.gov) using the bUstn2 
U 'algon to (Ai e chul« a l,^.^i!-.25:3389-3402(1997)). Tne parameters for this 
search were: E-0.05, v=50, B=50 (where E is the expected probability score cutoff, V is the 
number of database enWesre^edmrnerecortingof the re^ts, and B is tbe number of 
sequence alignments returned in the reporting of me results (Altschu! « al, J. Mol. Biol., 
215:403-410(1990)). 

20 7. Tta sequence was translated into protein for all six reading frames, and the 

protein sequences were compared to a non-redundant protein da^base cornpUed from 
GenpeptSwissprotPIR^ational Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905, 8600 EocMle P*e, Bemesda, MD 20894; 
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„ , n r\* \r-<n "R= SO where E. V, and B are defined as 

The parameters for this search were E=0.05, V-50, B . :>u, wnere ^ ; v } ^ 

above. 

8. The BAC DNA sequence was compared to the database of the cDNA clones 
derived from direct selection experiments (described below) using blastn2 (Altschul et al, 

5 Nucl Acids. Res., 25:3389-3402 (1997)). The parameters for this search were E=0.05, 
V=250, B=250, where E, V, and B are defined as above. 

9. The BAC sequence was compared to the sequences of all other BACs from the 
HBM region on chromosome llql2-13 using blastn2 (Altschul et al, Nucl. Acids. Res., 
25:3389-3402 (1997)). The parameters for this search were E=0.05, V=50, B=50, where E, V, 

10 and B are defined as above. 

10. The BAC sequence was compared to the sequences derived from the ends of 
BACs from the HBM region on chromosome llql2-13 using blastn2 (Altschul et al, Nucl. 
Acids. ^.,-25:3389-3402 (1997)). The parameters for this search were E=0.05, V=50,B=50, 

where E, V, and B are defined as above. 

■ n. The BAC sequence was compared to the Genbank database (National Center 

for Biotechnology Information National ^ 

Pike, Bethesda, MD 20894; wwwmcbi.nlmnih.gov) using Hastn2 (Altschul et al, Nucl. 
Acids. Res., 25:3389-3402 (1997)). The parameters for this search were E=0.05, V=50, B=50, 

where E, V, and B are defined as above. 
20 12. TheBACs.quencewascomparedtofh^STSdivisionofGeribaiitot.ba^ 

(National Center for Biotechnology Information, National Library of Medicine, 38A, 8N905, 
SSOORocfeviaeKke.Bethesda.MD 20894; www *cbi*lm.rrfh.gov) using blasts (Altschul 
etal, 1997). The parameters for this search were E^O-O?, V=50, B= 50, where E, V, and B 
are defined as above. 
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13. The BAC sequence was compared to the Expressed Sequence (EST) Tag 
Genbank database (National Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) 
using blasts (AltschuW^^ The parameters for 

5 this search were E=0.05, V=250, B=250, where E, V, and B are denned as above. 

EC Gene Identification by Direct cDNA Selection 

Primary linkered cDNA pools were prepared from bone marrow, calvarial bone, 

femoralbone, kidney, sM 

from calvarial and femoralbone tissue (Chomczynski et al, Anal Biochem., 162:156-159 
10 (1987); D'Alessio et al, Focus-, 9:1-4 (1987)) and the remainder of the mBNA was 

purchased from Clontech (Palo Alto, California). In order to generate oH g o(dT) and random 
' pnmedcDNApoolsfromthesametissue, 2.5 ug mRNA was mixed with oligo(dT) primer 
in one reaction and 2.5 ugmRNA was nnxed wim random hexamers in another reaction, and 
both were converted to first and second strand cDNA according to manufacturers 

15 recommendations^^ 

(see sequence below)were sealed together by mixmgm, 1:1 -ho (10 ^ g each) incubated 

at 65 °C for five minutes and allowed to cool to room temperature. 
Paired linkers oligol/2 

OLIGO 1: 5'CTGAGC GGA ATT CGT GAG ACC3' (SEQIDNO:12) 
OLIGO 2: 5'TTG GTC TCA CGT ATT CCG CTC GA3' (SEQ ID NO:13) 



20 



Paired linkers oligo3/4 

OLIGO 3: 5'CTC GAG AAT TCT GGA TCC TC3' (SEQ ID NO:14) 
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OLIGO 4: 5TTG AGG ATC CAG AAT TCT CGA G3' (SEQ ID M):15) 

Paired linkers oligo5/6 

OLIGO 5: 5TGT ATG CGA ATT CGC TGC GCG3' (SEQ ID NO:16) 
OLIGO 6: 5TTC GCG CAG CGA ATT CGC ATA CA3' (SEQ ID NO:17) 

5 Paired linkers oligo7/8 

OLIGO 7: 5'GTC CAC TGA ATT CTC AGT GAG3' (SEQ ID NO:18) 
OLIGO 8: 5TTG TCA CTG AGA ATT CAG TGG AC3' (SEQ ID NO:19) 

Paired linkers oligol 1/12 

OLIGO 11: 5'GAA TCC GAA TTC CTG GTC AGC3* (SEQ ID NO:20) 
10 OLIGO 12: 5TTG CTG ACC AGG AAT TCG GAT TC3* (SEQ ID NO:21) 

Linkers were Ugated to all oligo(dT) and random primed cDNA pools (see below) according 
to manufacturers instructions (Life Technologies, Bethesda, MD). 

OUgo 1/2 was Ugated to oligo(dT) and random primed cDNA pools prepared from 
bone marrow. OUgo 3/4 was Ugated to oUgo(dT) and random primed cDNA pools prepared 
15 from calvarial bone. OUgo 5/6 was Ugated to oligo(dT) and random primed cDNA pools 
preparedfrom brain and skeletal muscle. Oligo 7/8 was Ugated to oUgo(dT) and random 
primed cDNA pools prepared from kidney. OUgo 1 1/12 was ligated to oUgo(dT) and random 

primed cDNA pools prepared from femoral bone. 

The cDNApools were evaluated for length distribution by PCR amplification using 1 
20 I Uofal:l,l:10,andl:100dilutionoftheHgationreaction,re S pectively. PCRreactions 
were performed in a Perkin Elmer 9600, each 25 ul volume reaction contained 1 ul ofDKA, 
10 mM Tris-HCl (pH 8.3), 50 mMKCL 1.5 mM MgC12, 0.001% gelatin, 200 mM each 
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dNTPs, 10 piM primer and 1 unit Taq DNA polymerase (Perkin Elmer) and was amplified 
under the following conditions: 30 seconds at 94° C, 30 seconds at 60° C and 2 minutes at 
72 °C for 30 cycles. The length distribution of the amplified cDNA pools were evaluated by 
electrophoresis on a 1% agarose gel. The PGR reaction that gave the best representation of 
5 the random primed and oligo(dT) primed cDNA pools was scaled up so that -2-3 \ig of each 
cDNA pool was produced. The starting cDNA for the direct selection reaction comprised of 
0.5 jig of random primed cDNAs mixed with 0.5 }ig of oligo(dT) primed cDNAs. 

The DNA from the 54 B ACs that were used in the direct cDNA selection procedure 
was isolated using Nucleobond AX columns as described by the manufacturer (The Nest 
10 Group, Inc.). 

The BACs were pooled in equimolar amounts and 1 ^ig of the isolated genomic DNA 
was labelled with biotin 16-UTP by nick translation in accordance with the manufacturers 
instructions (Boehringer Mannheim). The incorporation of the biotin was monitored by 
methods that could be practiced by one skilled in the art (Del Mastro and Lovett, Methods in 
15 Molecular Biology, Humana Press Inc., NJ (1996)). 

Direct cDNA selection was performed using methods that could be practiced by one 
skilled in the art (Del Mastro and Lovett, Methods in Molecular Biology, Humana Press Inc., 
NJ (1996)). Briefly, the cDNA pools were multiplexed in two separate reactions: In one 
reaction cDNA pools from bone marrow, calvarial bone, brain and testis were mixed, and in 
20 the other cDNA pools from skeletal muscle, kidney and femoral bone were mixed. 

Suppression of the repeats, yeast sequences and plasmid in the cDNA pools was performed to 
a Cot of 20. 100 ng of biotinylated BAC DNA was mixed with the suppressed cDNAs and 
hybridized in solution to a Cot of 200. The biotinylated DNA and the cognate cDNAs was 
captured on streptavidin-coated paramagnetic beads. The beads were washed and the primary 
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selected cDNAs were doted. These cDNAs were PCR amplified and a second round of 
direct selection was performed. The product of the second.round of direct selection is 
referred to as the secondary selected material. A Galanin cDNA clone, previously shown to 
map to llql2-13 (Evans, Genomics, 18:473-477 (1993)), was used to monitor enrichment 

5 during the two rounds of selection. 

. The secondary selected material from bone marrow, calvarial bone, femoral bone, 
kidney, skeletal muscle, testis and total brain was PCR amplified using modified primers of 
oligos 1. 3, 5, 7 and 1 1, shown below, and cloned into the UDG vector pAMPIO (Life 
Technologies, Bethesda, MD), in accordance withthe manufacturer's recommendations. 

10 Modified primer sequences: 

OUgol-CUA: 5'CUA CUA CUA CUA CTG AGO GGAATT CGT GAG ACC3' (SEQ ID 
NO:22) 

OUgo3-CUA: 5'CUA CUA CUA CUA CTC GAG AAT TCT GGA TCC TC3' (SEQ ID 
KO:23) 

IS OUgoS-CUA: 5'CUA CUA CUA CUA TGT ATG CGA ATT CGC TGC GCG3' (SEQ ID 
1*0:24) 

01igo7-CUA: 5'CUA CUA CUA CUA GTC CAC TGA ATT CTC AGT GAG3' (SEQ ID 
NO:25) 

Oligoll-CUA: 5'CUA CUA CUA CUA GAATCC GAa'tTC CTG GTC AGC3' (SEQ ID 
20 NO:26) 

The cloned secondary selected material, from each tissue source, was transformed into 
MAX Efficiency DH5a Competent Cells (Life Technologies, Bethesda, MD) as 
" recommended by the manufacturer. 384 colonies were picked from each transformed source 
and arrayed into four 96 well microtiter plates. 
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All secondarily selected cDNA clones were sequenced using Ml 3 dye primer 
terminator cycle sequencing kit (Applied Biosystems), and the data collected by the ABI 377 
automated fluorescence sequencer (Applied Biosystems). 

All sequences were analyzed using the BLASTN, BLASTX and FASTA programs 
(Altschul et al, J. Mol. Biol, 215:403-410 (1990), Altschul et al., Nucl. Acids. Res., 25:3389- 
3402 (1997)). The cDNA sequences were compared to a database containing sequences 
derived from human repeats, mitochondrial DNA, ribosomal RNA, & coli DNA to remove 
background clones from the dataset using the program cross_match. A further round of 
comparison was also performed using the program BLASTN2 against known genes 
(Genbank) and the B AC sequences from the HBM region. Those cDNAs that were >90% 
homologous to these sequences were filed according to the result and the data stored in a 
database for further analysis. cDNA sequences that were identified but did not have 
significant similarity to the BAC sequences from the HBM region or were eliminated by 
cross_match were hybridized to nylon membranes which contained the BACs from the HBM 
region, to ascertain whether they hybridized to the target. 

Hybridization analysis was used to map the cDNA clones to the BAC target that 
selected them. The BACs that were identified from the HBM region were arrayed and grown 
into a 96 well microtiter plate. LB agar containing 25 ug/ml kanamycin was poured into 96 
well microtiter plate lids. Once the agar had solidified, pre-cut Hybond N+ nylon membranes 
(Amersham) were laid on top of the agar and the BACs were stamped onto the membranes in 
duplicate using a hand held 96 well replica plater (V&P Scientific, Inc.). The plates were 
incubated overnight at 37 °C. The membranes were processed according to the manufacturers 
recommendations. 
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The cDNAs that needed to be mapped by hybridization were PCR amplified using the 
relevant primer (oligos 1 , 3, 5, 7 and 1 1) that would amplify that clone. For this PCR 
amplification, the primers were modified to contain a lrnkered digoxigenin molecule at the 5' 
of the oligonucleotide. The PCR amplification was performed under the same conditions as 
5 describedmPreparationofcDNAPoolsCabove). The PCR products were evaluated for 
quality and quantity by electrophoresis on a 1% agarose gel by loading 5 ul of the PCR 
reaction. The nylon membranes containing the stamped BACs were individually pre- 
hybridized in 50 ml conical tubes containing 10 ml of hybridization solution (5x SSPE, 0.5x 
Blotto, 2.5% SDS and 1 mMEDTA (pH 8.0)). The 50 ml conical tubes were placed in a 
10 rotisserie oven (Robbins Scientific) for 2 hours at 65°C. 25 ng of each cDNA probe was 

denatured and added into individual 50 ml conical tubes containing the nylon membrane and 
hybridization solution. The hybridization was performed overnight at 65 °C The filters were 
washed for 20 minutes at 65°C in each of the following solutions: 3x SSPE, 0.1% SDS; lx 
SSPE, 0.1% SDS and O.lx SSPE, 0.1% SDS. 
15 The membranes were removed from the 50 ml conical tubes and placed in a dish. 

Acetate sheets were placed between each membrane to prevent them from sticking to each 
other. The incubation of the membranes with the Anti-DIG-AP and CDP-Star was performed 
according to manufacturers recommendations (Boehringer Mannheim). The membranes were 
wrapped in Saran wrap and exposed to Kodak Bio-Max X-ray film for 1 hour. 

20 X. cDNA Cloning and Expression Analysis 

To characterize the expression of me genes identified by direct cDNA selection and 
genomic DNA sequencing in comparison to the publicly available databases, a series of . 
experiments were performed to further characterize the genes in the HBM region. First, • 
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the small pieces were frozen in liquid nitrogen and reduced to a powder in a tissue pulverizer 
(Spectrum Laboratory Products). RNA was extracted from the powdered bone by 
homogenizing the powdered bone with a standard Acid Guamdinium 
Thiocyanate-Phenol-Chlorofonn extraction buffer (e.g. Chomczynski and Sacchi, Anal 
5 Biochem., 162:156-159 (1987)) using a polytron homogenizer (Brinkman Instruments). 
Additionally, human brain and lung total RNA was purchased from Clontech. PolyA RNA 
was isolated from total RNA using dynabeads-dT according to the manufacturer's 

recommendations (Dynal, Inc.)- 

First strand cDNA synthesis was initiated using an oligonucleotide primer with the 

10 sequence: S'-AACTGGAAGAATTC^^ 

(SEQ ID NO:27). This primer introduces a NotI restriction site (underlined) at the 3' end of 
the cDNA. First and second strand synthesis were performed using the "one-tube" cDNA 
synthesis kit as described by the anufactuxer (Life Technologies, Bethesda, MD). Double 
stranded cDNAs were treated with T4 polynucleotide kinase to ensure that the ends of the 
15 molecules were blunt (Sozks, in Automated DNA Sequencing and Analysis, Adams, Fields 
and Venter, Eds., Academic Press, NY, pages 110-1 14 (1994)), and the blunt ended cDNAs 
were then size selected by a Biogel column (Huynh et al. m-DNA Cloning, Vol. 1, Glover, 
Ed., IPX Press, Oxford, pages 49-78 (1985)) or with a size-sep 400 sepharose column 
(Pharmacia, catalog # 27-5105-01). Only cDNAs of 400 base pairs or longer were used in 
20 subsequent steps. EcoRI adapters (sequence: 5' OH-AATTCGGCACGAG-OH 3' (SEQ ID 
NO:28), and 5' p-CTCGTGCCG-OH 3' (SEQ ID NO:29)) were then ligated to the double 
stranded cDNAs by methods familiar to one skilled in the art (Scares, 1994). The EcoRI 
adapters were then removed from the 3' end of the cDNAby digestion withNotl (Scares, . 
1 994). The cDNA was then ligated into the plasmid vector pBluescript n KS + (Stratagene, 
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La Jolla, California), and the ligated material was transformed into E. coli host DH1 OB or 
DH12S by electroporation methods familiar to one skilled in the art (Soares, 1994). After 
growth overnight at 37 °C, DNA was recovered from the E. coli colonies after scraping the 
plates by processing as directed for the Mega-prep kit (Qiagen, Chatsworfh, California). The . 
5 quality of thecDNA libraries was estimated by counting a portion of the total numbers of 
primary transformants and determining the average insert size and the percentage of plasmids 
with no cDNA insert Additional cDNA libraries (human total brain, heart, kidney, 
leukocyte, and fetal brain) were purchased from Life Technologies, Bethesda, MD. 

cDNA libraries, both oligo (dT) and random hexamer (N<D primed, were used for 
10 isolating cDNA clones transcribed within the HBM region: human bone, human brain, human 
kidney and human skeletal muscle (all cDNA libraries were made by the inventors, except for 
skeletal muscle (dT) and kidney (dT) cDNA libraries). Four 10x10 arrays of each of the 
cDNA libraries were prepared as follows: the cDNA libraries were titered to 2.5 x 10 6 using 
primary transformants. The appropriate volume of frozen stock was used to inoculate 2 L of 
15 LB/ampicillin (100 mg/ml). This inoculated liquid culture was aliquotted into 400 tubes of 4 
ml each. Each tube contained approximately 5000 cfu. The tubes were incubated at 30°C 
overnight with gentle agitation. The cultures were grown to. an OD of 0.7-0.9. Frozen stocks 
were prepared for each of the cultures by aliquotting 100 /4 of culture and 300 ^1 of 80% 
glycerol. Stocks were frozen in a dry ice/ethanol bath and stored at -70 °C. The remaining 
20 culture was DNA prepared using the Qiagen (Chatsworth, CA) spin miniprep kit according to 
the manufacturer's instructions. The DNAs from the 400 cultures were pooled to make 80 
column and row pools. The cDNA libraries were determined to contain HBM cDNA clones 
of interest by PCIL Markers were designed to amplify putative exons. Once a standard PCR. 
optimization was performed and specific cDNA libraries were determined to contain cDNA 
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clones of interest, the markers were used to screen the arrayed library. Positive addresses 
indicating the presence of cDNA clones were confirmed by a second PCR using the same 
markers. 

Once a cDNA library was identified as likely to contain cDNA clones corresponding 
5 to a specific transcript of interest from the HBM region, it was manipulated to isolate the 
clone or clones containing cDNA inserts identical to the EST or direct-selected cDNA of 
interest. This was accomplished by amodification of the standard "colony screening" 
method (Sambrookef a/., iWb/^r Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor NY (1989)). Specifically, twenty 150 mm LB+ampicillin 
10 agar plates were spread with 20,000 co^ 

colonies allowed to grow overnight at 37°C. Colonies were transferred to nylon filters 
(Hybond from Amersham, or equivalent) and duplicates prepared by pressing two filters 
together essentially as described (Sambrook et al., Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor NY (1989)). The "master" plate was 
15 then incubated an additional 6-8 hours to allow the colonies to grow back. The DNA from 
the bacterial colonies was then affixed to the nylon filters by treating me filters sequentially 
with denaturing solution (0.5 N NaOH, 1.5 M NaCl) for two minutes, neutralization solution 
" (0.5MTris-ClpH8.0,1.5MNaCl)fortwommutes(twice). The bacterial colonies were 
removed from the filters by washing in a solution of 2X SSC/0.1% SDS for one minute while 
20 robbing with tissue paper. The filters were air dried and baked under vacuum at 80°C for 1-2 
hours. 

A cDNA hybridization probe was prepared by random hexamer labeling (Fineberg 
and Vogelstein, Anal. Biochem., 132:6-13 (1983)) or by including gene-specific primers and 
no random hexamers in the reaction (for small fragments). Specific activity was calculated 
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and was >5X10° cpm/10' u g of cDNA The colony Cranes were then prewashed in 10 
^MTris-ClpH 8.0, 1 MMaCl, 1 mM EDT A, 0.1% SDS for BOminutes at 55 °C. Mowing 

formamide, 2% SDS, 5X Denhardt's solution, and 100 ^ denied salmon sperm DNA, 
^ at 42°Cfor30minutes. The filters were then transferred to hybridization solution (6X SSC, 
' 2% SDS, 5X Denhardt's, 100 mg/ml denatured salmon sperm DNA) containing denatured 
a-P-dCTP-labelled cDNA probe and incubated at 42°C for 16-18 hours. 

• u ^filters were washed under constant agitation in 
After the 16-18 hour incubation, the inters were waan 

on miniites followed by two washes at 65 °C for 
2X SSC, 2% SDS at room temperature for 20 minutes, rounwc y 

, • n * v <?<?r 0 5% SDS for 15 minutes at 
10 ISnunuteseach. A second wash was performed m 0.5 X SSC, 0.5 /.MJ 

65 . C jmersweremenwrappedinplastic wrap and exposed. o radiographic film for several 
^ to overhigh.. After film development, individua! colonies on plates were aligned wrm 
^antoradiographso^^eycomd^picted^a.mlsolnnonoflBBromcon^ 

^iciUh, After shading a, 37°C for 1-2 hours, ahouots of the solution were plated on 150 
a mmplates for secondary screening. Secondary screening was identic*! to primary screening 

a ™„l^coiitainin» -250 colonies so mat individual 
(above) except that it was performed on plates contamm 0 

colonies could be clearly identified for picking. 

After colony sheening wim radiolabeled probes yielded cDNA clones, the clones 

fce fuU-lengm cDNA, me novel se^e from me end of me clone identified was used to 
probe me library again. This process was repeated until the length of the cDNA cloned 
^.ches that estimated to be fml-lengm by the norfhemblo. analysis. 
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RT-PCR was used as another method to isolate full length clones. The cDNA was 
synthesized and amplified using a "Superscript One Step RT-PCR" kit (Life Technologies, 
Gaithersburg, MD). The procedure involved adding 1.5 ug ofRNA to the following: 25 ul of 
reaction mix provided which is a proprietary buffer mix with MgS0 4 and dNTP's, 1 ul sense 
5 primer (10 uM) and 1 ul anti-sense primer (10 uM), 1 ul reverse transcriptase and Taq DNA 
polymerase mix provided and autoclaved water to a total reaction mix of 50 ul. The reaction 
was then placed in a thermocycler for i cycle at 50°C for 15 to 30 minutes, then 94°C for 15 
seconds, 55-60°C for 30 seconds and 68-72°C for 1 minute per kilobase of anticipated 
product and finally 1 cycle of 72°C for 5-10 minutes. The sample was analyzed on an 
10 agarose gel. The product was excised from the gel and purified from the gel (GeneClean, Bio 
101). The purified product was cloned in pCTNR (General Contractor DNA Cloning 
System, 5 Prime - 3 Prime, Inc.) and sequenced to verify that the clone was specific to the 
gene of interest. 

Rapid Amplification of cDNA ends (RACE) was performed following the 
15 manufacturer's instructions using a Marathon cDNA Amplification Kit (Clontech, Palo Alto, 
CA) as a method for cloning the 5' and 3' ends of candidate genes. cDNA pools were 
prepared from total RNA by performing first strand synthesis, where a sample of total RNA 
" sample was mixed with amodified oligo (dT) primer, heated to 70°C, cooled on ice and 
' followed by the addition of: 5x first strand buffer, 10 mM dNTP mix, and AMY Reverse 
20 Transcriptase (20 U/ M l). The tube was incubated at 42°C for one hour and then the reaction 
tube was placed on ice. For second strand synthesis, the following components were added 
directly to the reaction tube: 5x second strand buffer, 10 mM dNTP mi^ sterile water, 20x 
second strand enzyme cocktail and the reaction tube was incubated at 16°C for 1.5 hours. T4 
DNA Polymerase was added to the reaction tube and incubated at 16°C for 45 minutes. The 
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second-strand synthesis was terminated with the addition of an EDTA/Glycogen mix. The 
sample was subjected to a phenol/chloroform extraction and an ammonium acetate 
precipitation. The cDNA pools were checked for quality hy analyzing on an agarose gel for 
size distribution. Marathon cDNA adapters (Clontech) were then ligated onto the cDNA 
5 ends. The specific adapters contained priming sites that allowed for amplification of either 5> 
or V ends, depending on the orientation of the gene specific primer (GSP) that was chosen. 
An aliquot of the double stranded cDNA was added to the following reagents: 10 fuM 
Marathon cDNA adapter, 5x DNA ligation buffer, T4 DNA ligase. The reaction was 
incubated at 16 °C overnight. The reaction was heat inactivated to terminate the reaction. 
10 PCR was performed by the addition of the foUowing to the diluted double stranded cDNA 
pool: lOxcDNAPCRreaction buffer, 10 jjM dNTP mix, 10 piM GSP, 10^M API primer 
(kit), 50x Advantage cDNA Polymerase Mix. Thermal Cycling conditions were 94°C for 30 
seconds, 5 cycles of 94°C for 5 seconds, 72°C for 4 minutes, 5 cycles of 94°C for 5 seconds, 
'70»C for 4 minutes, 23 cycles of94°Cfor5 seconds, 68° C for 4 minutes. Afterthe first 
15 round of PCR was performed using the GSP.to extend to the end of the adapter to create the 
adapter primer binding site, exponential amplification of the specific cDNA of interest was 
observed. Usually a second nested PCR is performed to confirm the specific cDNA. The 
" RACE product was analyzed on an agarose gel and then excised and purified from the gel 
(GeneClean,BIO101). The RACE product was then cloned into pCTNR (General 
20 Contractor DNA Cloning System, 5' - 3', Inc.) and the DNA sequence determined to verify 
that the clone is specific to the gene of interest. 
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XI. Mutation Analysis 

Comparative genes were identified using the above procedures and the exons from 
each gene were subjected to mutation detection analysis. Comparative DNA sequencing was 
used to identify polymorphisms inHBM candidate genes from chromosome llql2-13. DNA 
5 sequences for candidate genes were amplified from patient lymphoblastoid cell lines. 

The inventors developed a method based on analysis of direct DNA sequencing of 
PCR products amplified from candidate regions to search for the causative polymorphism. 
The procedure consisted of three stages that used different subsets of HBM family to find 
segregating polymorphisms and a population panel to assess the frequency of the 
10 polymorphisms. The family resources result from a single founder leading to the assumption 
that all affected individuals will share the same causative polymorphism. 

Candidate regions were first screened in a subset of the HBM family consisting of the 
proband, daughter, and her mother, father and brother. Monochromosomal reference 
sequences were produced concurrently and used for comparison. The mother and daughter 
15 carried the HBM polymorphism in this nuclear family, providing the ability to monitor 
polymorphism transmission. The net result is that two HBM chromosomes and six non- 
HBM chromosomes were screened. This allowed exclusion, of numerous frequent alleles. 
*' Only alleles exclusively present in the affected individuals passed to the next level of 
analysis. 

20 Polymorphisms that segregated exclusively with the HBM phenotype in this original 

family were then re-exarnined in an extended portion of the HBM pedigree consisting of two 
additional nuclear families. These families consisted of five HBM and three unaffected 
individuals. The HBM individuals in this group included the two critical crossover 
individuals, providing the centromeric and telomeric boundaries of the critical region. 
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Tracking the heredity of polymorphisms between these individuals and their affected parents 
allowed for further refining of the critical region. This group brought the total of HBM 
chromosomes screened to seven and the total of non-HBM chromosomes to seventeen. 
When a given polymorphism continued to segregate exclusively with the HBM 
5 phenotype in the extended group, a population panel was men examined. This panel of 84 
persons consisted of 42 individuals known to have normal bone mineral density and 42 
individuals known to be unrelated but with untyped bone mineral density. Normal bone 
mineral density is within two standard deviations of BMD Z score 0. The second group was 
ftom the widely used CEPH panel of individuals. Any segregating polymorphisms found to 
«. be rare in this population were subsequently examined on the entire HBM pedigree and a 
larger population. 

Polymerase chain reaction (PGR) was used to generate sequencing templates from the 
HBMfamfly'sDNAandmonochromosomalcontrols. Enzymatic amplification of genes- 
v«e HBM region on H ql2-13 was acco^^^ 
15 flanking each exon as well as the putative S regulatory elements of each gene. Theprimers 
• were chosen to amplify each exon as well as 15 or more base pairs within each intron on 
either side of the splice. All PCR primers were made as chimeras to facilitate dye primer 
• sequencing. The Ml 3-21F (5'- GTA A CGA CGG CCA GT -3') (SEQ ID NO:30) and - 
28REV (*. AAC AGC TAT GAC CAT G -30 (SEQ ID NO:31) primer binding sites were 
20 built on to the 5' end of each forward and reverse PCR primer, respectively, during synthesis. 
1 50 ng of genomic DNA was used in a 50 ul PCR with 2UAmpliTaq, 500 nM primer and 
125 uM dNTP. Buffer and cycling conditions were specific to each primer set. TaqStart 
antibody (Clontech) was used for hot start PCR to mmimize primer dimer formation. 10% of 
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the product was examined on an agarose gel. The appropriate samples were diluted 1:25 with 

deionized water before sequencing. 

Each PCR product was sequenced according to the standard Energy Transfer primer 
(Amersham) protocol. All reactions took place in 96 well trays. 4 separate reactions, one 
5 each for A, C, G and T were performed for each template. Each reaction included 2 ul of the 
sequencing reaction mix and 3 ul of diluted template. The plates were then heat sealed with - 
foil tape and placed in a thermal cycler and cycled according to the manufacturer's 
recommendation. After cycling, the 4 reactions were pooled. 3 ul of the pooled product was 
transferred to anew 96 well plate and 1 ul of the manufacturer's loading dye was added to 
10 each well. All 96 well pipetting procedures occurred on a Hydra 96 pipetting station 

(Robbins Scientific, USA). 1 ul of pooled material was directly loaded onto a 48 lane gel 
ruiming on an ABI 377 DNA sequencer for a 10 hour, 2.4 kV run. 

• Polyphred (University of Washington) was used to assemble sequence sets for 
viewing with Consed (University of Washington). Sequences were assembled in groups 
15 representing all relevant family members and controls for a specified target region. This was 
done separately for each of the three stages. Forward and reverse reads were included for 
each individual along with reads from the monochromosomal templates and a color annotated 
" reference sequence. Polyphred indicated potential polymorphic sites with a purple flag. Two 
readers independently viewed each assembly and assessed the validity of the purple-flagged 
20 sites. 

A total of 23 exons present in the mature mRNA and several other portions of the 
primary transcript were evaluated for heterozygosity in the nuclear family of two HBM- 
affected and two unaffected individuals. Twenty-five single nucleotide polymorphisms 
(SNPs) were identified, as shown in the table below. 
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TABLE 4: Single Nucleotide Polymorphisms in the Zmaxl gene 



10 



15 



20' 



25 



Exon Name 
b 200e21-h_Contigl_l .at 
b200e21-b._Contig4_12.nt 
b200e21-h_Contig4_13.pt 
b200e21-h_Contig4__l6.nt 

b200e21-h_ Contig4_21 .nt 
b200e21-h_Contig4_22.nt-a 
b200e21-h_ Contig4_22.nt-b 
b200e21-h Contig4_22.nt-c 
b200e21-h_ Contig4_24.nt-a 
b200e21-h Contig4_24.nt-b 
b200e21-h Contig4_25Jit 
b200e21-h Contig4_33.nt-a 
b200e21-h Contig4_33.nt-b 
b200e21-h Contig4_61.nt 
b200e21-h Comjg4_62.nt 
b200e21-h. Contig4_63.nt-a 
b200e21-h Contig4_63.nt-b 
V.?.00e21-h Contig4_9.iit 
KS'TMia-h Contig030g_lJit-a 
T^r7rM2-h Contig030g_l-nt-b 
T^7-7<m-h Contig080C_2.nt 
KQ7H12-h Contig087C_lJit 
i^77H12-h Contig087C_4.nt 
i^T7Hl2-h Contig088C_4.nt 
w;77rn?-h Contis089_lHPJt 



Location 
69169 (309G) 
27402 (309G) 
27841 (309G) 
35600 (309G) 
45619 (309G) 
46018 (309G) 
46093 (309G) 
46190 (309G) 
50993 (309G) 
51124 (309G) 
55461 (309G) 

63645 (?09G) 

63646 (309G) 
24809 (309G) 
27837 (309G) 
31485 (309G1 
31683 (309G) 

24808^309(5). 
31340 (308G^ 
32538 (308G) 
13224 (308G) 
21119 (308G) 
30497 (308G) 
24811 (309G) 
68280 (309G) 



and Environs 

Bas e Cha nge 
C/A 
A/G 
T/C 
A/G 
G/A 
T/G 
T/G 
A/G 
T/C 
C/T 
C/T 
C/A 
A/C 
T/G 
T/C 
C/T 
A/G 
T/G 



-72- 



WO 01/92891 ' PCT/USO 1/16946 



In addition to the polymorpnisms presented in TaDie 4, two additional 
polymorphisms can also be present in SEQ ID NO:2. These is a change at position 2002 of 
SEQ ID NO:2. Either a guanine or an adenine can appear at this position. This 
polymorphism is silent and is not associated with any change in the amino acid sequence. 
5 The second change is at position 4059 of SEQ ID NO:2 corresponding in a cytosine (C) to 
thymine (T) change. This polymorphism results in a corresponding amino acid change from 
a valine (V) to an alanine (A). Other polymorphisms were found in the candidate gene exons 
and adjacent intron sequences. Any one or combination of the polymorphisms listed in Table 
4 or the two discussed above could also have a minor effect on bone mass or lipid levels 
10 when present in SEQ ID NO:2. 

The present invention encompasses the nucleic acid sequences having the nucleic acid 
sequence of SEQ ID NO: 1 with the above-identified point mutations. 

Preferably, the present invention encompasses the nucleic acid of SEQ ID NO: 2. 
Specifically, a base-pair substitution changing G to T at position 582 in the coding sequence 
15 of Zmaxl (the HBM gene) was identified as heterozygous in all HBM individuals, and not 
found in the unaffected individuals (i.e., b527dl2-h_Conng087C_l.nt). Fig. 5 shows the 
order of the contigs inB527D12. The direction of transcription for the HBM gene is from left 
to right The sequence of contig308G of B527D12 is the reverse complement of the coding 
region to the HBM gene. Therefore, the relative polymorphism in contig 308G shown in 
20 Table 4 as a base change substitution of C to A is the complement to the G to T substitution 
in the HBM gene. This mutation causes a substitution of glycine 171 with valine (G171V). 

The HBM polymorphism was confirmed by examining the DNA sequence of different 
groups of individuals. In all members of the HBM pedigree (3 8 individuals), the HBM 
polymorphism was observed in the heterozygous form in affected (i.e., elevated bone mass) 
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individuals only (N-1S). h^Md relanves (N=20) (BMDZ^.O) the HBM 
po.vmorphism was never observed. To determine whether this gene was ever observed » 

.hesitcof^^gen. None were heterozygous at the site of the HBM polymorphism. In 
5 anmrphenotypedcontro.group.1 of« individuals was observed to be heterozygous at 
p osition582. Smoetmsmdividualisdeceaseimeirbonemrncraldensivcouldnotbe 
obtained. Tatentogemer, these data prove that me polymorphism observed in thekindred 
paying me high bone mass phenol is strongly correlated with the <WT polymorphism 
atpositionSSiofZmaxl.'Taken.ogemer.meseresultses^blishthatmeHBM 
10 polymorpmsmgen^ysegregateswimmeHBMphenotype.andmatbommeHBM 

polymorphism and phenotype are rare in the general population. 

XH Allele Specific Oligonucleotide (ASO) Analysis 

The amplkon containing the HBM1 poly^norphism wasPCR amplifiedusing primers 
specific forme exon of interest The appropriate population of individuals was PCR 
15 am P U fi edm96weUmicro«i.erpl,esasfoUows. PCRreaetions (20 ul) containing IX 
PromegaPCRbuffer (Cat #M1883 confining 1.5 -MMyM. ^ dNTP, 200 nM 
' PCRprimers (1863P: CCAAGTTCTGAGAAGTCC and X864R: AATACCTGAAACCAT 
ACCTG) X O Amplitac, and20 ng of genomic DNA were prepared and amplified under the 
foUowingPCRconditions: 9*1 1 minute, (94-C, 30 sec, 58* 30 sec, 7*1 1 min.) X35 
20 cycles), W , 5.,^ hold. Loading dye was then added and 10 ul of the products was 

S-XOminutes. Gelsweretrea,ed20mi n ntcs fa denatn ri ngsclunon(1.5MNaa0. 5 H 
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1 .5 M NaCl, for 20 minutes and rinsed with water. Uels were soaked in 1 0 X fciiiU tor ZU 
minutes and blotted onto nylon transfer membrane (Hybond N+- Amersham) in 10X SSC 
overnight. Filters were the rinsed in 6X SSC for 10 rninutes and UV crosslinked. 

The allele specific oligonucleotides (ASO) were designed with the polymorphism 
5 approximately in the middle. Oligonucleotides were phosphate free at the 5 'end and were 
purchased from Gibco BRL. Sequences of the oligonucleotides are: 
2326 Zmaxl-ASO.g: agactggggtgagacgc 
2327Zmaxl.ASO.t: cagactgggxtgagacgcc 
The polymorphic nucleotides are underlined. To label the oligos, 1 .5 ul of 1 jig/ul ASO 
10 oligo (2326.Zmaxl.ASO.g or 2327.Zmaxl.ASO.t), 11 ul ddH 2 O s 2 ul 10X kinase forward 
buffer, 5 ul y- 32 P-ATP (6000 Ci/rnMole), and 1 p.1 T4 polynucleotide kinase (10 U/jxl) were 
mixed, and the reaction incubated at 37°C for 30-60 minutes. Reactions were then placed at 
95°C for 2 minutes and 30 ml H 2 0 was added. The probes were purified using a G25 
microspin column (Pharmacia). 
15 Blots were prehybridized in 10 ml 5X SSPE, 5X Denhardt's, 2% SDS, and 100 ng/ml, 

denatured, sonicated salmon sperm DNA at 40°C for 2 hr. The entire reaction mix of kinased 
oligo was then added to 10 ml fresh hybridization buffer (5X SSPE, 5X Denhardts, 2% SDS) and 
hybridized at 40°C for at least 4 hours to overnight 

All washes done in 5X SSPE, 0.1 % SDS. The first wash was at 45°C for 15 minutes; 
20 the solution was then changed and the filters washed 50°C for 15 minutes. Filters were then 
exposed to Kodak biomax film with 2 intensifying screens at -70°C for 15 minutes to 1 hr. If 
necessary the filters were washed at 55°C for 15 rninutes and exposed to film again. Filters 
were stripped by washing in boiling 0.1X SSC, 0.1% SDS for 10 minutes at least 3 times. 
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The two films that best captured the allele specific assay with the 2 ASOs were converted 
into digital images by scaiming them into Adobe PhotoShop. These images were overlaid 
against each other in Graphic Converter and.then scored and stored in FileMaker Pro 4.0 (see 
Fig. 9). 

5 XEIL Cellular Localization of Zmaxl 

A. Gene Expression in Rat tibia by non isotopic In Situ Hybridization 

In situ hybridization was conducted by Pathology Associates International (PAI), 
Frederick, MD. This study was undertaken to determine the specific cell types that express 
• the Zmaxl gene in rat bone with particular emphasis on areas of bone growth and remodeling. 
10 Zmaxl probesusedinthisstudy were generated from both human (HuZmaxl) and mouse 
(MsZmaxl) cDNAs, which share an 87% sequence identity. The homology of human and 
mouse Zmaxl with rat Zmaxl is unknown. 

For example, gene expression by non-isotopic in situ hybridization was performed as 
follows, but other methods would be known to the skilled artisan. Tibias were collected from 
15 two 6 to 8 week old female Sprague Dawley rats euthanized by carbon dioxide asphyxiation. 
Distal ends were removed and proximal tibias were snap frozen in OCT embedding medium 
with liquid nitrogen immediately foEowing death. Tissues were stored in a -80'C freezer. 

Probes for amplifying PCR products from cDNA were prepared as follows. The 
primers to amplify PCR products from a cDNA clone were chosen using published sequences 
20 of both human LRP5 (Genbank Accession No. AB017498) and mouse LRP5 (Genbank 
Accession No. AF064984). In order to rninimize cross reactivity with other genes in the 
LDL receptor family! the PCR products were derived from an intracellular portion of me 
' protein coding region. PCR was performed in a 50 ul reaction volume using cDNA clone as 



-76- 



WO. 01/92891 



PCT/US01/16946 



template. 1>UK. reactions contained l.i mM MgCl^ 1 unit Amplitaq, 200 dNl'JJs and 2 
\xM each primer. PCR cycling conditions were 94°C for 1 min., followed by 35 cycles of 
94°C for 30 seconds, 55°C for 30 seconds, 72°C for 30 seconds; followed by a 5 minute 
extension at 72°C. The reactions were then run on a 1.5 % agarose Tris-Acetate gel. DNA 
5 was eluted from the agarose, ethanol precipitated and resuspended in 10 mM Tris, pH 8.0. 
Gel purified PCR products were prepared for both mouse and human cDNAs and supplied to 
Pathology Associates International for in situ hybridizations. 

The sequence of the human and mouse PCR primers and products were as follows: 
Human Zmaxl sense primer (HBM1253> 

10 CCCGTGTGCTCCGCCGCCCAGTTC 

Human Zmaxl antisense primer fHBM1465^ 

GGCTCACGGAGCTCATCATGGACTT 

Human Zmaxl PCR product 

CCCGTGTGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCrGCGCCTGCGCTGCGACGGCGAG 

15 GCAGACTGTCAGGACCGCTCAGA^ 

GGCCAGTGTGTCCTCATCAAACAGCAGTC 

GAAA.TCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGCAGTGCCATCGGGCCCG 
TCTCXCTTOGTCATGGG^ 

CCCTTCCCGCACGAGTATGTCAGCGGGACCCCGCACGTGCCCCT 
20 GGCCCCTTCACAGGCATCGCATGCGGAAAGTCCATGA.TGAGCTCCGTGAGCC 

Mouse Zmaxl Sense primer fHBM16551 

AGCGAGGCCACCATC CACAGG 

Mouse Zmaxl antisense primer flHBMieSfr) 

TCGCTGGTCGGCATAATCAAT 
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AGCAGAGC CA _~,raTCTACTGGACTGATGTTAGCCrrCA&GACGATCAGC 
GAGGCCTCTGCACTGGACTTTGATGTGTCCAACAATCACATCTACTGGACT 

GAGGCCTt- _^p arT TTGGCCTCGACTACCCTGAAGGAATGGCTGTG 
CGAGCCTTCATGAATGGGAGCTCAGTGGAGCACGTGATTGAGTTTGGCCTCGA 

CGAGCCTTCAT . ^^^gcccGGCTGGATGGGCAG 

5 gactggatgggcaagaacctctattgggcggacacagggaccaacag 

Biboprobes were synthesized 3, Mow, ThePCR product were rean^ed with 
M ohhneric printers designed - top— — a 13 proper »ps*eam, or . T7 pronroter 
ao^of^^plific^onprod^The^PCRpro.uc.swc.e^as 

templ ate to ^ digoxigenin-labe.ed riboprobes by ft. transcription <WT>. 
Anusense and sense riboprobes were synced using T7 and T3 SNA poiymerases, 
rap ecuvely>thepresenceofdigo*^ 
15 M^mttCA^^**"— rer. The DNA was then degraded 
^Dnase-X.and^corpcrateddigoxigeninwas — by— anon. Kiboprobe 
tategrity was assessedbyeiec.rophoresU.taongbadenamringpolyaorytanndege!. 

^Udder (Atnbion). Probe yieid and deling was evaded by b,o, inununochenus*y. 

20 Wboprobeswerestoredin5ulanquotsat-80-C, ^ 

rne I - B ^hybridizationwasp= rf or m edasfonows.Frozen r a,bonewascu. m to5 

m sections on a Jung CM3000 cryosta, (Leiea) and counted on adhesive slides 
medics, Sec^were.eptinthecryos^at^C^laU.eshde, were prepared 

inolde r to preven t n^degrada n onprior,opos,-nxanonforl5nnnutesin4% 
25 pa—dehyde. Mowing post-fixation, secuons were incubated of erfcer 
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antisense or sense riboprobe in Pathology Associates International (PAT) customized 
hybridization buffer for approximately 40 hours at 58°C. Following hybridization, sHdes 
were subjected to a series of post-hybridization stringency washes to reduce nonspecific 
probe binding. Hybridization was visualized by imm unohistochemistry with an anti- 
5 digoxigenin antibody (FAB fragment) conjugated to alkaline phosphatase. Nitroblue 

tetrazolium chloride/bromochloroindolyl phosphate (Boehringer-Mannheim), a precipitating 
alkaline phosphatase substrate, was used as the chromogen to stain hybridizing cells puiple to 
nearly black, depending on the degree of staining. Tissue sections were counter-stained with 
nuclear fast red. Assay controls included omission of the probe, omission of probe and anti- 
10 digoxigenin antibody. 

Specific cell types were assessed for demonstration of hybridization with antisense 
probes by visualising a purple to black cytoplasmic and/or peri-nuclear staining indicating a 
positive hybridization signal for mRNA. Each cell type was compared to the replicate 
sections, which were hybridized with the respective sense probe. Results were considered 
15 positive if staining was observed with the antisense probe and no staining or weak 
background with the sense probe. 

The cellular localization of the hybridization signal for each of the study probes is 
summarized in Table 5. Hybridization for Zmaxl was primarily detected in areas of bone 
involved in remodeling, including the endosteum and trabecular bone within the metaphysis. 
20 Hybridization in selected bone lining cells of the periosteum and epiphysis were also 
observed. Positive signal was also noted in chondrocytes within the growth plate, 
particularly in the proliferating chondrocytes. See Figs. 10, 11 and 12 for representative 
photomicrographs of in situ hybridization results. 
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TABLE 5 

Summary of Zmaxl in situ hybridization in rat tibia 



TO T-fc f~\ Y"> n 

JrROJBE 


Site 


ISH SIGNAL 


Hu Zmaxl 


Epiphysis 






Osteoblasts 


+ 




Osteoclasts 






Growth Plate 






resting chondrocytes 


- 




proliferating chondrocytes 


+ 




hypertrophic chondrocytes 


- 




Metanhvsis 






osteoblasts 






osteoclasts 


+ 




Diaphysis 


- 




Endosteum 






osteoblasts 


+ 




osteoclasts 






Periosteum 


- 














MsZmaxl 


Epiphysis 






Osteoblasts 


+ 




Osteoclasts 


- 




Growth Plate 






resting chondrocytes 






proliferating chondrocytes ■ *" 






hypertrophic chondrocytes 






Metaphvsis 






osteoblasts 






osteoclasts 






Diaohvsis 






Endosteum 






osteoblasts 






osteoclasts 






Periosteum 


+ 









Legend: "+" = hybridization signal detected "-" = no hybridization signal detected 
"ISH" -In situ hybridization 
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These studies confirm the positional expression of Zmaxl* in cells involved in bone 
remodeling and bone formation. Zmaxl expression in the zone of proliferation and in the 
osteoblasts and osteoclastsof the proximal metaphysis, suggests that the Zmaxl gene is 
involved in the process of bone growth and mineralization. The activity and differentiation of 

5 osteoblasts and osteoclasts are closely coordinated during development as bone is formed and 
during growth as well as in adult life as bone undergoes continuous remodeling. The 
formation of internal bone structures and bone remodeling result from the coupling of bone 
resorption by activated osteoclasts with subsequent deposition of new material by osteoblasts. 
Zmaxl is related to the LDL receptor gene, and thus may be a receptor involved in 

10 mechanosensation and subsequent signaling in the process of bone remodeling. Therefore, 
changes in the level of expression of this gene, could impact on the rate of remodeling and 
degree of mineralization of bone. Similar studies can be designed for in situ analysis of 
HBM or Zmaxl in other cells or tissues. 

XTV. Antisense 

15 Antisense oligonucleotides are short synthetic nucleic acids that contain 

complementary base sequences to a targeted RNA. Hybridization of the RNA in .living cells 
with the antisense oligonucleotide interferes with RNA function and ultimately blocks protein 
expression. Therefore, any gene for which the partial sequence is known can be targeted by 
an antisense oligonucleotide. 

20 Antisense technology is becoming a widely used research tool and will play an 

increasingly important role in the validation and elucidation of therapeutic targets identified 
by genomic sequencing efforts. 
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Anhsense technology was developed t0 ^ gene by ^ ^ 

ohgonudeotide complementary tQ ^ ^ ^ ^ ^ ^ ^ ^ sevml 

degradation of m*N A byENMeHis _ ider=d (o ofa3ibaM ^ 

proteins. TMst^^^^^^^^^ 

gene, bu, may also have therapeutic appucan 0 ns, „ „ ^ ^ 

properly. 

• A, example of materkb md mrthods fa prq)aiiag ^ oI . gonucbot . des ^ ^ 
performed 33 Mows. ^ ^ bMn fa ^ 

10 Setter (Haact MA) using me amisense .echno.ogy b fte ^ ^ ^ 

MC3T3. ^c*. aMtgIedtoiwtlT ^ ttaifciMB ^ 

m *»»*«d is character^ 

-Q-edfor^uenttoduodonofdiff^Mo,,^ . Once the maM* syndesis 
* begins, osteoUastmarlcergenes are acnvatedta aclearternporal sequence: ^ 

phosphatase is induced at earlv times wm.t,™ • i 

early times while bone sialopronen and osteocalcin appear later in 

the differennaSonprocess. This temporal sequence of gene expression is useful in 
storing thematnranon and.ninera^on^ 

begm una several days after inatnration has started, involves deposition of mmera, on and 

collagen fibril-associated mineral formed by cultured osteoblast resembles that found in 
woven bone m vivo and therefore is used fteoucntly as a sntdy reagent 

MC3T3 cells were transfected with antisense oligouucleoudes for the firs, week of the 
differentiation, according «o the manufacturer's specifications (US. Paten, No. 5,849 902) 
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The oligonucleotides designed lor Zmaxl are given below: 
10875: AGUACAGCUUCUUGCCAACCCAGUC 
10876: UCCUCCAGGUCGAUGGUCAGCCCAU 
10877: GUCUGAGUCCGAGUUCAAAUCCAGG 
5 Figure 13 shows the results of antisense inhibition of Zmaxl in MC3T3 cells. The three 

oligonucleotides shown above were transfected into MC3T3 and RNA was isolated according 
to standard procedures. Northern analysis clearly shows markedly lower steady state levels 
of the Zmaxl transcript while the control gene GAPDH remained unchanged. Thus, 
antisense technology using the primers described above allows for the study of the role of 

10 Zmaxl expression on bone biology. Similar primers can be used to study Zmaxl expression 
and its ability to regulate lipid levels in an animal. 

The protein encoded by Zmaxl is related to the Low Density Lipoprotein receptor 
(LDL receptor). See, Goldstein et al, Ann. Rev. Cell Biology, 1:1-39 (1985); Browne* al, 
Science, 232:34-47 (1986). The LDL receptor is responsible for uptake of low density 

15 lipoprotein, a lipid-protein aggregate that includes cholesterol. Individuals with a defect in 
the LDL receptor are deficient in cholesterol removal and tend to develop atherosclerosis. In 
addition, cells with a defective LDL receptor show increased production of cholesterol, in 
part because of altered feedback regulation of cholesterol synthetic enzymes and in part 
because of increased transcription of the genes for these enzymes. In some cell types, 

20 cholesterol is a precursor for the formation of steroid hormones. 

Thus, the LDL receptor may, directly or indirectly, function as a signal transduction 
protein and may regulate gene expression. Because Zmaxl is related to the LDL receptor, 
this protein may also be involved in signaling between cells in a way that affects bone 
remodeling as well as regulate lipid levels and therefore lipid-mediated diseases. 
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'me glycine 171 ammo acid is Wcely to be important tor the tunction of Zmaxl 
because this amino acid is also found in the mouse homologue of Zmaxl. The closely related 
LRP6 protein also contains glycine at the corresponding position (Brown et aL 9 Biochemical 
and Biophysical Research Comm., 248:879-888 (1988)). Amino acids that are important in a 
5 protein's structure or function tend to he conserved between species, because natural selection 
prevents mutations with altered amino acids at important positions from arising. 

In addition, the extracellular domain of Zmaxl contains four repeats consisting of five 
YWT motifs followed by an EFG motif. This 5YWT+EGF repeat is likely to form a distinct 
folded protein domain, as this repeat is also found in the LDL receptor and other LDL 

10 receptor-related proteins. The first three 5YWT+EGF repeats are very similar in their 

structure, while the fourth is highly divergent. Glycine 171 occurs in the central YWT motif 
of the first 5YWT+EGF repeat in Zmaxl. The other two similar 5YWT+EGF repeats of 
Zmaxl also contain glycine at the corresponding position, as does the 5YWT+EGF repeat in 
the LDL receptor protein. However, only 17.6% of the amino acids are identical among the 

15 first three 5YWT+EGF repeats in Zmaxl and the single repeat in the LDL receptor. These 
observations indicate that glycine 171 is essential to the function of this repeat, and mutation 
of glycine 171 causes a functional alteration of Zmaxl. The cDNA and peptide sequences are 
shown in Figs. 6A-6E. The critical base at nucleotide position 582 is indicated in bold and is 
underlined. 

20 Northern blot analysis (Figs, 7A-B) reveals that Zmaxl is expressed in human bone 

tissue as well as numerous other tissues. A multiple-tissue Northern blot (Clontech, Palo 
' Alto, CA) was probed with exons from Zmaxl . As shown in Fig. 7 A, the 5.5 kb Zmaxl 
transcript was highly expressed in heart, kidney, lung, liver and pancreas and is expressed at . 
lower levels in skeletal muscle and brain. A second northern blot, shown in Fig. 7B, 
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confirmed the transcript size at 5.5 kb, and indicated that Zmaxl is expressed in bone, bone 
marrow, calvaria and human, osteoblastic cell lines. 

Taken together, these results indicate that the HBM polymorphism in the Zmaxl gene 
is responsible for the HBM phenotype, and that the Zmaxl gene is important in bone 
5 development In addition, because mutation of Zmaxl can alter bone mineralization and 
development as well as lipid levels, it is likely that molecules that bind to Zmaxl may 
usefully alter bone development and lipid levels. Such molecules may include, for example, 
small molecules, proteins, RNA aptamers, peptide aptamers, and the like. 

XV. Preparation of Nucleic Acids, Vectors, Transformations and Host Cells 
10 " Large amounts of the nucleic acids of the present invention may be produced by 

replication in a suitable host cell. Natural or synthetic nucleic acid fragments coding for a 
desired fragment will be incorporated into recombinant nucleic acid constructs, usually DNA 
constructs, capable of introduction into and replication in aprokaryotic or eukaryotic cell. 
Usually the nucleic acid constructs will be suitable for replication in a unicellular host, such 
15 as yeast or bacteria, but may also be intended for introduction to (with and without 

integration within the genome) cultured mammalian or plant or other eukaryotic cell lines. 
The purification of nucleic acids produced by the methods of the present invention is 
described, for example, in Sambrook et al, Molecular Cloning. A Laboratory Manual, 2nd 
Ed. (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al, 
20 Current Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). 

The nucleic acids of the present invention may also .be produced by chemical 
synthesis, e.g., by the phosphoramidite method described by Beaucage et al., Tetra. Letts., 
22:1859-1862 (1981) or the triester method according to Matteucci, et al., J. Am. Chem. Soc, 
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103:3185 (1981), and may be performed on commercial, automated oligonucleotide 
synthesizers. A double-stranded fragment may be obtained from the single-stranded product 
of chemical synthesis either by synthesizing the complementary strand and annealing the 
strands together under appropriate conditions or by adding the complementary strand using 
5 DNA polymerase with an appropriate primer sequence. 

Nucleic acid constructs prepared for introduction into a prokaryotic or eukaryotic host 
may comprise a replication system recognized by the host, including the intended nucleic 
acid fragment encoding the desired protein, and will preferably also include transcription and 
transitional initiation regulatory sequences operably linked to the protein encoding segment. 
10 Expression vectors may include, for example, an origin of replication or autonomously 
replicating sequence (ARS) and expression control sequences/a promoter, an enhancer and 
necessary processing information sites, such as ribosome-binding sites, RNA splice sites, 
polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. 
Secretion signals may also be included where appropriate, whether from a native HBM or 
15 Zmaxl protein or from other receptors or from secreted proteins of the same or related 

species, which allow the protein to cross and/or lodge in cell membranes, and thus attain its 
functional topology, or be secreted from the cell. Such vectors may be prepared by means of 
standard recombinant techniques well known in the art and discussed, for example, in 
Sambrook et al. , Molecular Cloning. A Laboratory Manual, 2nd Ed. (Cold Spring Harbor 
20 Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al, Current Protocols in 
Molecular Biology, J. Wiley and Sons, NY (1992). 

An appropriate promoter and otier necessary vector sequences will be selected so as 

• tobefunctionalhithehost,a^^ 
with Zmaxl orfiBMgenes. Examples of workable combinations of cell lines and expression 
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vectors are described in Sambrook etal, Molecular Cloning. A Laboratory Manual, 2nd Hd. 
(Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al., Current 
Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). Many useful vectors are 
known in the art and may be obtained from such vendors as Stratagene, New England 
5 BioLabs, Promega Biotech, and others. Promoters such as the tip, lac and phage promoters, 
tRNA promoters and glycolytic enzyme promoters may be used in prokaryotic hosts. Useful 
yeast promoters include promoter regions for inetallothionein, 3-phosphoglycerate kinase or 
other glycolytic enzymes such as enolase or glyceraldehyde-3-phosphate dehydrogenase, 
enzymes responsible for maltose and galactose utilization, and others. Vectors and promoters 
10 suitable for use in yeast expression are further described in EP 73,675A. Appropriate non- 
native mammalian promoters might include the early and late promoters from SV40 (Fiers et 
al, Nature, 273:1 13 (1978)) or promoters derived from murine Moloney leukemia virus, 
mouse tumor virus, avian sarcoma viruses, adenovirus H, bovine papilloma virus or polyoma. 
In addition, the construct may be joined to an amphfiable gene (e.g., DHFR) so that multiple 
15 copies of the gene may be made. For appropriate enhancer and other expression control 
sequences, see also Enhancers and Eukaryotic Gene Expression, Cold Spring Harbor Press, 
Cold Spring Harbor, NY (1983). 

While such expression vectors may replicate autonomously, they may also replicate 
by being inserted into the genome of the host cell, by methods well known in the art. 
20 Expression and cloning vectors will likely contain a selectable marker, a gene 

encoding a protein necessary for survival or growth of a host cell transformed with the Vector. 
The presence of this gene ensures growth of only those host cells which express the inserts. 
Typical selection genes encode proteins that a) confer resistance to antibiotics or other toxic 
substances, e.g. ampicillin, neomycin, methotrexate, etc.; b) complement auxotrophic 
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aenciencies, or c, suppiy cnucaj numenis not avaiiame nom complex meaia, e.g., ine gene 
encoding D-alanine racemase for Bacilli. The choice of the proper selectable marker will 
depend on the host cell, and appropriate markers for different hosts are well known in the art 
The vectors containing the nucleic acids of interest can be transcribed in vitro, and the 

5 resulting l^Aintrodu^ 

Kubo et al, FEBS Letts. 241:1 19 (1988)), or the vectors can be introduced directly into host 
cells by methods well known in the art, which vary depending on the type of cellular host, 
including electroporation; transfection employing calcium chloride, rubidium chloride, 
calcium phosphate, DEAE-dextran, or other substances; microprojectile bombardment; 

10 lipofection; infection (where the vector is an infectious agent, such as a retroviral genome); 
and other methods. See generally, Sambrook etal, 1989 and Ausubel et ah, 1992. The 
introduction of the nucleic acids into the host cell by any method known in the art, including 
those described above, will be referred to herein as "transformation." The cells into which 
have been introduced nucleic acids described above are meant to also include the progeny of 

15 such cells. 

Large quantities of the nucleic acids and proteins of the present invention may be 
prepared by expressing the Zmaxl or HBM nucleic acids or portions thereof in vectors or 
other expression vehicles in compatible prokaryotic or eukaryotic host cells. The.most 
commonly used prokaryotic hosts are strains of Escherichia colU ^though other prokaryotes, 
20 such as Bacillus subtilis or Pseudomonas may also be used. 

Mammalian or other eukaryotic host cells, such as those of yeast, filamentous fungi, 
plant, insect, or amphibian or avian species, may also be useful for production of the proteins 
of the present invention. Propagation of mammalian cells in culture is per se well known. 
See, Jakoby and Pastan (eds.), Cell Culture. Methods in Enzymology, volume 58, Academic 
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Press, Inc., Harcourt Braes Jovanovich, NY, (1979)). Examples of commonly used 
mammalian host cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO) cells, 
and WI38, BHK, and COS cell lines, although it will be appreciated by the skilled 
practitioner that other cell lines may be appropriate, e.g., to provide higher expression 
5 desirable glycosylation patterns, or other features. 

Clones are selected by using markers depending on the mode of the vector 
construction. The marker may be on the same or a different DNA molecule, preferably the 
same DNA molecule. Li prokaryotic hosts, the transformant may be selected, e.g., by 
resistance to ampicillin, tetracycline or other antibiotics. Production of a particular product 
10 based on temperature sensitivity may also serve as an appropriate marker. 

Prokaryotic or eukaryotic cells transformed with the nucleic acids of the present 
invention will be useful not only for the production of the nucleic acids and proteins of the 
present invention, but also, for example, in studying the characteristics of Zmaxl or HBM 
proteins* 

15 Antisense nucleic acid sequences arc useful in preventing or diminishing the 

expression of Zmaxl or HBM, as will be appreciated by one skilled in the art. For example, 
nucleic acid vectors containing all or a portion of the Zmaxl or HBM gene or other sequences 
from the Zmaxl or HBM region may be placed under the control of a promoter in an 
antisense orientation and introduced into a cell. Expression of such an antisense construct 

20 within a cell will interfere with Zmaxl or HBM transcription and/or translation and/or 
replication. 

The probes and primers based on the Zmaxl and HBM gene sequences disclosed 
herein are used to identify homologous Zmaxl and HBM gene sequences and proteins in 
other species. These Zmaxl and HBM gene sequences and proteins are used in the 
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' diagnostic/prognostic, therapeutic and drug screening methods described herein for the 
species from which they have been isolated. 

XVI. Protein Expression and Purification 

Expression and purification of the HBM protein of the invention can be performed 
5 essentially as outlined below. To facilitate the cloning, expression and purification of 
membrane and secreted protein from the i£5A/gene, a gene expression system, such as the 
pET System (Novagen), for cloning and expression of recombinant proteins in E. coli was 
selected. Also, a DNA sequence encoding a peptide tag, the His-Tap, was fused to the 3 f end 
of DNA sequences of interest to facilitate purification of the recombinant protein products. 

10 The 3 f end was selected for fusion to avoid alteration of any 5 1 terminal signal sequence. 

Nucleic acids chosen, for example, from the nucleic acids set forth in SEQ ID NOS: 
1, 3 and 5-12 for cloning HBM were prepared by polymerase chain reaction (PGR). 
Synthetic oligonucleotide primers specific for the 5' and 3' ends of the HBM nucleotide 
sequence were designed and purchased from Life Technologies (Gaithersburg, MD). All 

15 forward primers (specific for the 5 ! end of the sequence) were designed to include an Ncol 
cloning site at the 5 ! terminus. These primers were designed to permit initiation of protein 
translation at the methionine residue encoded within the Ncol site followed by a valine 
residue and the protein encoded by the HBM DNA sequence. All reverse primers (specific 
for the 3' end of the sequence) included an EcoRI site at the 5 1 terminus to permit cloning of 

20 - the HBM sequence into the reading frame of the pET-2Sb. The pET-2 8b vector provided a 
sequence encoding an additional 20 carboxyl-terminal amino acids including six histidine 
residues (at the C-tenninus), which comprised the histidine affinity tag. 
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genomic DJNA prepared trorn toe HBM gene was used as the source ot template jjjn A 
for PGR amplification (Ausubel et al, Current Protocols in Molecular Biology, John Wiley 
& Sons (1994)). To amplify a DNA sequence containing the HBM nucleotide sequence, 
genomic DNA (50 ng) was introduced into a reaction vial containing 2 mM MgCl^ 1 |iM 
5 synthetic oligonucleotide primers (forward and reverse primers) complementary to and 
flanking a defined HBM, 0.2 mM of each of deoxynucleotide triphosphate, dATP, dGTP, 
dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular 
Systems, Inc., Branchburg, NJ) in a final volume of 100 jil. 

Upon completion of thermal cycling reactions, each sample of amplified DNA was 
10 purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD). All 

amplified DNA samples were subjected to digestion with the restriction endonucleases, e.g., 
Ncol and EcoRI (New England BioLabs, Beverly, MA) (Ausubel et ah, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)). DNA samples were then subjected to 
electrophoresis on 1.0% NuSeive (FMC BioProducts, Rockland, ME) agarose gels. DNA 
15 was visualized by exposure to ethidium bromide and long wave UV irradiation. DNA 

contained in slices isolated from the agarose gel was purified using the Bio 101 GeneClean 
Kit protocol (Bio 101, Vista, CA). 

The pET-28b vector was prepared for cloning by digestion with restriction 
endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, MA) (Ausubel et ah, 
20 Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)). The pET-28a 
vector, which encodes the histidine affinity tag that can be fused to the 5' end of an inserted 
gene, was prepared by digestion with appropriate restriction endonucleases. 

Following digestion, DNA inserts were cloned (Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)) into the previously digested pET-28b 
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session vector. Products of the ligation reaction we then used ,0 tarim the BUI 
strainof * »» (Ausubel e,ai, Curr^ Protocol in Molecular Biology, John Wiley & Sons, 

Inc (1994)) as described below. 

Competent bacteria, E. coli strain BL21 or* rf^BUl (DE3),w=re 
5 formed wit recombinant pET expression plasmids carrying ft. cioned HBM sequence 

Wiley & Sons, Inc. (1994)). Briefly, 1 ul of ligation reaction was mixed whh 50 ul of 
dectrocompeten, cells and subjected to a high voltage pulse, after which samples were 
Abated in 0.45 ml SOC medium (0.5% yeast exfract, 2.0% fryptone, 10 mM NaCI, 2.5 mM 
W KC1, 10 mM MgCl, 10 mM MgS0 4 and 20 mM glucose) at 37°C wim shaldng for 1 hour. 
Samples were men spread on LB agar plates contannng 25 ^ Icanamycin suKate for 

„ , • f-QTOi WP re then Dicked and analyzed to evaluate 
growth overnight. Transformed colonies of BL21 were then picJce 

cloned inserts, as described below. 

m d reverse primers specific for me HBM sequences that were used in me origina! PCR 

/a v»i a *n l Current Protocols in Molecular Biology, 
sequence in the expression vector (Ausubel et al, Current Prow 

John Wiley & Sons, Inc. (1994)). 
20 Individual clones of recombinant pET-28b vectors carrying properly cloned HBM 

nucleotide sequences were picked and incubate, in 5 ml of LB broth plus 25 ug/ml 
fcnamycin suUate overnight T*e following day plasmid WA was isolated and purrfrf 
^g toC iagenplasrmdpurificafionpro^ol(0^genmc.,a^or*,CA). 
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The pET vector can be propagated in any E. coli K-12 strain, e.g., HMS174, HB101, 
JM109, DH5 and the like, for purposes of cloning or plasmid preparation. Hosts for 
expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA. 
polymerase. These hosts were lysogens of bacteriophage DE3, a lambda derivative that 
5 carries the lad gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA 
polymerase was induced by addition of isopropyl-p-D-thiogalactoside (EPTG), and the T7 
RNA polymerase transcribes any target plasmid containing a functional T7 promoter, such as 
pET-28b, carrying its gene of interest. Strains include, for example, BL21(DE3) (Studier. er 
al.,Meth. Enzymol, 185:60-89 (1990)). 
10 To express the recombinant HBM sequence, 50 ng of plasmid DNA are isolated as 

described above to transform competent BL21(DE3) bacteria as described above (provided 
by Novagen as part of the pET expression kit). The lacZ gene (P-galactosidase) is expressed 
in the pET-System as described for the HBM recombinant constructions. Transformed cells 
were cultured in SOC medium for 1 hour, and the culture was then plated on LB plates 
15 containing 25 ug/ml kanamycin sulfate. The following day, the bacterial colonies were 
pooled and grown in LB medium containing kanamycin sulfate (25 ug/ml) to an optical 
density at 600 nM of 0.5 to 1 .0 O.D. units, at which point l^mM IPTG was added to the 
culture for 3 hours to induce gene expression of the HBM recombinant DNA constructions. 
After induction of gene expression with IPTG, bacteria were collected by 
20 centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. Pellets were 
resuspended in 50 ml of cold mM Tris-HCL pH 8.0,- 0.1 M NaCl and 0.1 mM EDTA (STE 
buffer). Cells were then centrifuged at 2000 x g for 20 minutes at 4°C. Wet pellets were 
weighed and frozen at -80 °C until ready for protein purification. 
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A variety of methodologies known in the art can be used to purify the isolated 
proteins (Coligan et al 9 Current Protocols in Protein Science, John Wiley & Sons (1995)). 
For example, the frozen cells can be thawed, resuspended in buffer and ruptured by several 
passages through a small volume microfluidizer (Model M-110S, Microfluidics International 
5 Corp., Newton, MA). The resultant homogenate is centrifiiged to yield a clear supernatant 
(crude extract) and, following filtration, the crude extract is fractioned over columns. 
Fractions are monitored by absorbance at OD 2g0 nm and peak fractions may be analyzed by 
SDS-PAGE. 

The concentrations of purified protein preparations are quantified 
10 spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, Eur. J. Biochem., 157:169-180 (1986)). Protein concentrations are also measured 
by the method of Bradford, Anal Biochem., 72:248-254 (1976) and Lowry et al, J. Biol 
Chern., 193:265-275 (1951) using bovine serum albumin as a standard. 

SDS-polyacrylamide gels of various concentrations were purchased from BioRad 
15 (Hercules, CA), and stained with Coomassie blue. Molecular weight markers may include 
rabbit skeletal muscle myosin (200 kDa), E. coli p-galactosidase (116 kDa), rabbit muscle 
phosphorylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), bovine 
carbonic anyhdrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 
kDa) and bovine aprotirrifi (6.5 kDa). 
20 Once a sufficient quantity of the desired protein has been obtained, it may be used for 

various purposes. A typical use is the production of antibodies specific for binding. These 
antibodies may be either polyclonal or monoclonal, and may be produced by in vitro or in 
vivo techniques well known in the art. Monoclonal antibodies to epitopes of any of the 
peptides identified and isolated as described can be prepared from murine hybridomas 
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(Kohler, Nature, 256:495 (1975)). In summary, a mouse is inoculated with a few micrograms 
ofHBM protein over a period of two weeks. The mouse is then sacrificed. The cells that 
produce antibodies are then removed from the mouse's spleen. The spleen cells are then 
fused with polyethylene glycol with mouse myeloma cells. The successfully fused cells are 
5 diluted in a microliter plate and growth of the culture is continued. The amount of antibody 
per well is measured by immunoassay methods such as ELISA (Engvall, Meth. EnzymoL, 
70:419 (1980)). Clones producing antibody can be expanded and further propagated to 
produce HBM antibodies. Other suitable techniques involve in vitro exposure of 
lymphocytes to the antigenic polypeptides, or alternatively, to selection of libraries of 
10 antibodies in phage or similar vectors. See Huse et al., Science, 246:1275-1281 (1989). For 
additional information on antibody production see Davis et al, Basic Methods in Molecular 
Biology, Elsevier, NY, Section 21-2 (1989). 

Standard protocols for assessing the influence of an agent {e.g., antibody, HBM 
protein, protein polymorphism or Zmaxl protein or compound) to alter lipid levels in a cell or 
15 the physiological levels in a subject are known. For example, see F.W. HEMMING, LIPID 
' Analysis (Bios Scientific Pub. 1996) and J. M. Ordovas, Lipoprotein Protocols 
(Humana Press Inc. 1997). More specifically, cholesterol and triglyceride analysis can be 
performed using the Olympus AU5000 Cholesterol method. This method of measuring 
cholesterol combines the use of the enzymes with a modification of the peroxidase-pheol-4- 
20 aminoantipyrine system, substituting 2-hydroxy-3,5-dichloroben2ene sulfonic acid (2-OH 3,5 
DCBSA) for the phenolic group for the measurement of total cholesterol in the subject serum. 
The assay is based on a series of coupled enzymatic reactions. Cholesterol esters present in 
serum are hydrolyzed to free cholesterol and fatty acids by cholesterol esterase. The 
cholesterol is in rum oxidized by cholesterol oxidase to cholest-4-en-3-one with the 
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simultaneous production ot Hydrogen peroxidase. The Hydrogen peroxidase reacts with 4- 
arninoantipyrine in the presence of 2-OH-3 3 5-DCBSA to produce a chromophore that absorbs 
at 570 nm. The absorbance of the reaction mixture is measured biochromatically at 570/750 
nm and is proportional to the cholesterol concentration of the sample. 

For serum triglyceride analysis, the Olympus AU5000 triglyceride procedure can also 
be used. Briefly, it is based on a series of coupled enzymatic reactions. Triglycerides in the 
serum are hydrolyzed to free fatty acids and glycerol by lipoprotein lipase. Glycerol is 
phosphorylated enzymatically and then oxidized with glycerol phosphate oxidase. The 
hydrogen peroxidase reacts with the chromogen 4-arrnno-antipyrine in the presence of DCB 
Sulfonic Acid to give a chromophore with absorption which is measured bichromatically at 
520/660 nm. The increase in absorbance of the reaction mixture is proportional to the 
triglyceride concentration of die sample. 

XVII. Methods of Use: Gene Therapy 

hi recent years, significant technological advances have been made in the area of gene 
therapy for both genetic and acquired diseases. (Kay et al., Proc. Natl Acad. ScL USA, 
94:12744-12746 (1997)) Gene therapy can be denned as the deliberate transfer of DNA for 
therapeutic purposes. Improvement in gene transfer methods has allowed for development of 
gene therapy protocols for the treatment of diverse types of diseases. Gene therapy has also 
taken advantage of recent advances in the identification of new therapeutic genes, 
improvement in both viral and nonviral gene delivery systems, better understanding of gene 
regulation, and improvement in cell isolation and transplantation. 

The experiments below identify the HSM gene as a dorninant mutation conferring 
elevatedbone mass and that alters lipid levels. The fact that this mutation is dorninant 



-96- 



WO 01/92891 



PCT7US01/16946 



indicates that expression of the HBM protein causes elevated bone mass and perhaps changes 
in lipid levels. Older individuals carrying the HBM gene, and, therefore expressing the HBM 
protein, do not suffer from osteoporosis. These individuals are equivalent to individuals 
being treated with the HBM protein. These observations are a strong experimental indication 
5 that therapeutic treatment with the HBM protein prevents osteoporosis. The bone mass - 
elevating activity of the HBM gene is termed "HBM function." 

Therefore, according to the present invention, a method is also provided of supplying 
HBM function to mesenchymal stem cells (Onyia et al. 9 J. Bone Miner. Res., 13:20-30 
(1998); Ko et al, Cancer Res., 56:4614-4619 (1996)). Supplying such a function provides 

10 protection against osteoporosis. For regulating lipid levels, HBM function can be supplied to 
liver cells, as well as other cells involved in lipid metabolism and lipid regulation (e.g., 
muscle cells, lesion cells, lipid laiden foam cells and megakaryoblasts). The HBM gene or a 
part of the gene may be introduced into the cell in a vector such that the gene remains 
extrachromosomaL In such a situation, the gene will be expressed by the cell from the 

15 extrachromosomal location. 

Vectors for introduction of genes both for recombination and for extrachromosomal 
maintenance are known in the art, and any suitable vector may be used. Methods for 
introducing DNA into cells such as electroporation, calcium phosphate co-precipitation, and 
viral transduction are known in the art, and the choice of method is within the competence of 

20 one skilled in the art (Robbins, Ed., Gene Therapy Protocols, Human Press, NJ (1997)). 
Cells transformed with the HBM gene can be used as model systems to study osteoporosis 
and drug treatments that promote bone growth as well as to study lipid-mediated diseases. 

As generally discussed above, the HBM gene or fragment, where applicable, may be 
used in gene therapy methods in order to increase the amount of the expression products of 
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such genes in mesenchymal stem cells or in other cells. It may be useful also to increase the . 
level of expression of a given HBM protein, or a fragment thereof, even in those cells hi 
which the wild type gene is expressed normally. Gene therapy would be carried out 
' according to generally accepted methods as described by, for example, Friedman, Therapy for 

5 Genetic Mseases.VTiah^ (1991) ' 

A virus or plasmid vector containing a copy of the HBM gene linked to expression 
control elements and capable of replicating inside mesenchymal stem cells or liver cells, is 
prepared. Suitable vectors are known and described, for example, in U.S . Patent No. 
5,252,479 and WO 93/07282, the disclosures of which are incorporated by reference herein in 
10 their entirety. The vector is then injected into the patient, either locally into the bone marrow 
or liver, or systemically (in order to reach any mesenchymal stem cells located at other sites, 
Le, in the blood). If the transfected gene is not permanently incorporated into the genome of 
each of the targeted cells, the treatment may have to be repeated periodically. 
• Gene transfer systerr* known ^ 
15 therapy methods ofthe present invention. These include viral and non-viral transfer methods. 
A number of viruses have been used as gene transfer vectors, including polyoma, La, SV40 
(Madzak et al, J. Gen. Virol, 73:1533-1536 (1992)), adenovirus (Berkner, Cwr. Top. 
Microbiol. Immunol, 158:39-61 (1992); Berkner et al, Bio Techniques, 6:616-629 (1988); 
Gorziglia et al, J. Virol, 66:4407-4412 (1992); Quantin et al, Proc. Natl Acad. Sci. USA, 
20 89:2581-2584 (1992); Rosenfeld et al, Cell, 68:143-155 (1992); Wilkinson et al, Nucl 
Acids Res., 20:2233-2239 (1992); Stratford-Penicaudet et al, Hum. Gene Ther., 1:241-256 
(1990)), vaccinia virus" (Mackett et al, Biotechnology, 24:495-499 (1992)), adeno-associated 
virus (Muzyczka, Curr. Top. Microbiol. Immunol, 158:91-123 (1992); Ohi et al, Gene, . 
89:279-282 (1990)), herpes viruses including HSV andEBV (Margolskee, Curr. Top. 
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Microbiol Immunol., 158:67-90 Johnson et al, J. yirol., 66:2y52-2y65 (iyy2); Fink 

etai, Hum. Gene Ther., 3:11-19 (1992); Breakfield etal,Mol Neurobiol., 1:337-371 
(1987;) Fresse et al, Biochem. Pharmacol, 40:2189-2199 (1990)), and retroviruses of avian 
(Brandyopadhyay et al, Mol Cell Biol, 4:749-754 (1984); Petropouplos et al, J. Virol, 

5 66:3391-3397 (1992)), murine (Miller, Curr. Top. Microbiol Immunol, 158:1-24 (1992); 
Miller et al, Mol Cell Biol, 5:431-437 (1985); Sorge et al, Mol. Cell Biol, 4:1730-1737 
(1984); Mann et al, J. Virol, 54:401-407 (1985)), and human origin (Page et al, J. Virol, 
64:5370-5276 (1990); Buchschalcher al, J. Virol, 66:2731-2739 (1992)). Most human 
gene therapy protocols have been based on disabled murine retroviruses. 

10 Non-viral gene transfer methods known in the art include chemical techniques such as 

calcium phosphate coprecipitation (Graham et al, Virology, 52:456-467 (1973); Pellicer et 
al, Science, 209:1414-1422 (1980)), mechanical techniques, for example microinjection 
(Anderson et al, Proc. Natl Acad. Sci. USA, 77:5399-5403 (1980); Gordon et al, Proc. Natl 
Acad. Sci. USA, 77:7380-7384 (1980); Brinster et al, Cell, 27:223-23 1 (1981); Constantini et 

15 al , Nature, 294:92-94 (1981)), membrane fusion-mediated transfer via liposomes (Feigner et 
al, Proc. Natl. Acad. Sci. USA, 84:7413-7417 (1987); Wang et al, Biochemistry, 28:9508- 
9514 (1989); Kaneda et al, J, Biol Chem., 264:12126-12129 (1989); Stewart et al.,Hum. 
Gene Ther., 3:267-275 (1992); Nabel et al, Science, 249:1285-1288 (1'990); Lim et al, 
Circulation, 83:2007-201 1 (1992)), and direct DNA uptake and receptor-mediated DNA 

20 transfer (Wolff et al, Science, 247:1465-1468 (1990); Wu et al.,BioTechniques, 11:474-485 
(1991); Zenke et al,Proc. Natl. Acad Sci. USA, 87:3655-3659 (1990); Wu et al.,J. Biol 
Chem., 264:16985-16987 (1989); Wolff era/., BioTechniques, 11:474-485 (1991); Wagner 
et al, 1990; Wagner et al, Proc. Natl Acad. Set USA, 88:4255-4259 (1991); Cotten et al, 
Proc. Natl Acad. Sci USA, 87:4033-4037 (1990); Curiel et al, Proc. Natl Acad. Sci. USA, 
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88:8850-8854 (1991); Curiel et al, Hum. Gene Ther., 3:147-154 (1991)). Viral-mediated 
gene transfer can be combined with direct in vivo vectors to the mesenchymal stem cells and 
not into the surrounding ceUs (Romano et al, In Vivo, 12(1):59.67 (1998); Gonez etal, 
Hum. Mol Genetics, 7(12):1913-9 (1998)). Alternatively, the retroviral vector producer cell 
5 line can be injected into the bone marrow (Culver et al, Science, 256: 1550-1552 (1992)). 
Injection of producer cells would then provide a continuous source of vector particles. This 
technique has been approved for use in humans with inoperable brain tumors. 

In an approach which combines biological and physical gene transfer methods, 
plasmid DNA of any size is combined with a polylysine-conjugated antibody specific to the 
10 adenovirus hexon protein, and the resulting complex is bound to an adenovirus vector. The 
trimolecular complex is then used to infect cells. The adenovirus vector permits efficient 
binding, internalization, and degradation of the endosome before the coupled DNA is 
damaged. 

Liposome/DNA complexes have been shown to be capable of mediating direct in vivo 
15 gene transfer. While in standard liposome preparations the gene transfer process is non- 
specific, localized in vivo uptake and expression have been reported in rumor deposits, for 
example, following direct in situ administration (NabeL Hum.. Gene Ther., 3:399-410 (1992)).. 

XVm. Methods of Use: Transformed Hosts, Development of Pharmaceuticals and 
Research Tools 



20 



Cells arid animals that carry the HBMgene can be used as model systems to study and 
test for substances that have potential as therapeutic agents (Onyia et al, J. Bone Miner. Res., 
13:20-30 (1998); Broder et al., Bone, 21:225-235 (1997)). The cells are typically cultured 
mesenchymal stem cells or liver cells. These may be isolated from individuals with somatic 
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or germline HBM genes. Alternatively, the cell line can be engineered to carry the HBM 
gene, as described above. After a test substance is applied to the cells, the transformed 
phenotype of the cell is determined. Any trait of transformed cells can be assessed, including 
formation of bone matrix in culture (Broder et aL, Bone, 21:225-235 (1997)), mechanical 

5 properties (Kizer et aL, Proc. Natl Acad. Scu USA, 94:1013-1018 (1997)), and response to 
application of putative therapeutic agents. 

Animals for testing therapeutic agents can be selected after treatment of ger mlin e cells 
or zygotes. Such treatments include insertion of the Zmaxl gene, as well as insertion of the 
HBM gene and disrupted homologous genes. Alternatively, the inserted Zmaxl gene(s) 

10 and/or i2BMgene(s) of the animals may be disrupted by insertion or deletion mutation of 
other genetic alterations using conventional techniques, such as those described by, for 
example, Capechi, Science, 244:1288 (1989); Valancuis et aL, Mol Cell Biol, 1 1: 1402 
(1991); Hasty et at, Nature, 350:243 (1991); Shinkai et aL, Cell, 68:855 (1992); Mombaerts 
et ah, Cell, 68:869 (1992); Philpott et aL, Science, 256:1448 (1992); Snouwaert et al. 9 

15 Science, 257:1083 (1992); Donehower et aL, Nature, 356:215 (1992). After test substances 
have been administered to the animals, the growth of bone or modulation of lipids must be 
assessed. If the test substance enhances the growth of bone or regulates lipid levels, then the 
test substance is a candidate therapeutic agent These animal models provide an extremely 
important vehicle for potential therapeutic products. Preferred models for studying lipid 

20 modulation include mice (Smith et al t J. Intern, Med., 242: 99-109 (1997)) and guinea pigs. 

Individuals carrying the HBM gene have elevated bone mass and altered lipid levels 
as discussed in the example below. The HBM gene causes this phenotype by altering the 
activities, levels, expression patterns, and modification states of other molecules involved in 
bone development. Using a variety of established techniques, it is possible to identify 
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molecules, preferably proteins or mRNAs, whose activities, levels, expression patterns, and 
modification states are different between systems containing the Zmax 1 gene and systems 
containing the HBM gene. Such systems can be, for example, cell-free extracts, cells, tissues 
or living organisms, such as mice or humans. For a mutant form of Zmax 1, a complete 
5 deletion of Zmaxl , mutations lacking the extracellular or intracellular portion of the protein, 
or any other mutation in the Zmaxl gene may be used. It is also possible to use expression of 
antisense Zmaxl RNA or oligonucleotides to inhibit production of the Zmaxl protein. For a 
mutant fonn of HBM, a complete deletion of HBM, mutations lacking the extracellular or 
intracellular portion of the HBM protein, or any other mutation in the i25Mgene may be 
10 used. It is also possible to use expression of antisense HBM RNA or oligonucleotides to 
• inhibit production of the HBM protein. 

Molecules identified by comparison of Zmaxl systems and HBM systems can be used 
as surrogate markers in pharmaceutical development or in diagnosis of human or animal bone 
disease. Alternatively, such molecules may be used in treatment of bone disease. See, 
15 Schena et aL, Science, 270:467-470 (1995). 

For example, a transgenic mouse carrying the HBM gene in the mouse homologue is 
constructed. A mouse of the genotype HBM/+ is viable, healthy and has elevated bone mass. 
To identify surrogate markers for elevated bone mass, HBM-/+ (i.e., heterozygous) and 
isogenic +/+ (i.e., wild-type) mice are sacrificed. Bone tissue mRNA is extracted from each 
20 animal, and a "gene chip" corresponding to mRNAs expressed in the +/+ individual is 
constructed. mRNA from different tissues is isolated from animals of each genotype, 
reverse-transcribed, fluorescently labeled, and then hybridized to gene fragments affixed to a 
solid support. The ratio of fluorescent intensity between the two populations is indicative of 
the relative abundance of the specific mRNAs in the 4-/+ and HBM/+ animals. Genes 
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encoding mRNAs over- and under-expressed relative to the wild-type control are candidates 
for genes coordinately regulated by the HBM gene. This strategy can be similarly used to 
study lipid regulation. 

Mice also serve as the most common experimental animal model for atherosclerosis 
5 research. There are at least three ways of inducing atherosclerosis in mice: (1) diet induced, 
apoE deficiency-induced and LDL receptor-deficiency induced. The methods for using a 
mouse model for testing agents which modulate lipid levels in vivo can be performed as 
described in Smith et al , J, Intern. Med. 242: 99- 1 09 (1997). 

One standard procedure for identification of new proteins that are part of the same 

10 signaling cascade as an already-discovered protein is as follows. Cells are treated with 

radioactive phosphorous, and the already-discovered protein is manipulated to be more or less 
active. The phosphorylation state of other proteins in the cell is then monitored by 
polyacrylamide gel electrophoresis and autoradiography, or similar techniques. Levels of 
activity of the known protein may be manipulated by many methods, including, for example, 

15 comparing wild-type mutant proteins using specific inhibitors such as drugs or antibodies, 
simply adding or not adding a known extracellular protein, or using antisense inhibition of 
the expression of the known protein (Tamura et al, Science, 280(5369): 1614-7 (1998); 
Meng, EMBOX, 1 7(1 5):439 1-403 (1998); Cooper et al, Cell, 1:263-73 (1982)). 

In another example, proteins with different levels of phosphorylation are identified in 

20 TE85 osteosarcoma cells expressing either a sense or antisense cDNA for Zmaxl. TE85 cells 
normally express high levels of Zmaxl (Dong et al, Biochem. & Biophys. Res. Comm., 
251 :784-790 (1998)). Cells containing the sense construct express even higher levels of 
Zmaxl, while cells expressing the antisense construct express lower levels. Cells are grown 
in the presence of 32 P, harvested, lysed, and the lysates run on SDS polyacrylamide gels to 
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separate proteins, and the gels subjected to autoradiography (Ausubel et ah, Current 
Protocols in Molecular Biology;, John Wiley & Sons (1997)). Bands that differ in intensity 
between the sense and antisense cell lines represent phosphoproteins whose phosphorylation 
state or absolute level varies in response to levels of Zmaxl . As an alternative to the 32 P- 
5 labeling, unlabeled proteins may be separated by SDS-PAGE and subjected to 

immunoblotting, using the commercially available anti-phosphotyrosine antibody as a probe 
(Thomas et al, Nature, 376(6537):267-71 (1995)). As an alternative to the expression of 
antisense RNA, transfection with chemically modified antisense oligonucleotides can be used 
(Woolf et al 9 Nucleic Acids Res,, 18(7):1763-9 (1990)). 

10 Many bone disorders, such as osteoporosis, have a slow onset and a slow response to 

treatment. It is therefore useful to develop surrogate markers for bone development and 
mineralization. Such markers can be useful in developing treatments for bone disorders, and 
for diagnosing patients who may be at risk for later development of bone disorders. 
Examples of preferred markers are N- and C-tenninal telopeptide markers described, for 

15 example, in U.S. Patent Nos. 5,455,179, 5,641,837 and 5,652,112, the disclosures of which 
axe incorporated by reference herein in their entirety. In the area of HIV disease, CD4 counts 
and viral load are useful surrogate markers for disease progression (Vlahov et ah, JAMA, 
279(l):35-40 (1998)). There is a need for analogous surrogate markers in the area of bone 
disease. 

20 A surrogate marker can be any characteristic that is easily tested and relatively 

insensitive to non-specific influences. For example, a surrogate marker can be a molecule 
such as a protein or mRNA in a tissue or in blood serum. Alternatively, a surrogate marker 
may be a diagnostic sign such as sensitivity to pain, a reflex response or the like. 



-104- 



WO 01/92891 



PCT/USO 1/16946 



In yet another example, surrogate markers for elevated bone mass are identified using 
a pedigree of humans carrying the HBM gene. Blood samples are withdrawn from three 
individuals that carry the HBM gene, and from three closely related individuals that do not. 
Proteins in the serum from these individuals are electrophoresed on a two dimensional gel 

5 system, in which one dimension separates proteins by size, and another dimension separates 
proteins by isoelectric point (Epstein et al, Electrophoresis, 17(1 1):1655-70 (1996)). Spots 
corresponding to proteins are identified. A few spots are expected to be present in different 
amounts or in slightly different positions for the HBM individuals compared to their normal 
relatives. These spots correspond to proteins that are candidate surrogate markers. The 

10 identities of the proteins are detennined by microsequencing, and antibodies to the proteins 
can be produced by standard methods for use in diagnostic testing procedures. Diagnostic 
assays for HBM proteins or other candidate surrogate markers include using antibodies 
described in this invention and .a reporter molecule to detect HBM in human body fluids, 
membranes, bones, cells, tissues or extracts thereof. The antibodies can be labeled by joining 

15 them covalently or noncovalently with a substance that provides a detectable signal. In many 
scientific and patent literature, a variety of reporter molecules or labels are described 
including radionuclides, enzymes, fluorescent, chemi-lummescent or chromogenic agents 
(U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 
4,366,241). 

20 Using these antibodies, the levels of candidate surrogate markers are measured in 

normal individuals and in patients suffering from a bone disorder, such as osteoporosis, 
osteoporosis pseudoglioma, Engelmann's disease, Ribbing's disease, hyperphosphatasemia, 
VanBuchem's disease, melorheostosis, osteopetrosis, pychodysostosis, sclerosteosis, 
osteopoikilosis, acromegaly, Paget's disease, fibrous dysplasia, tubular stenosis, osteogenesis 
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imperfecta, hypoparathyroidism, pseudohypoparathyroidism, 
pseudopseudohypoparathyroidism, primary and secondary hyperparathyroidism and 
associated syndromes, hypercalciuria, medullary carcinoma of the thyroid gland, 
osteomalacia and other diseases including lipid-mediated diseases. Techniques for measuring 
5 levels of protein in serum in a clinical setting using antibodies are well established. A protein 
that is consistently present in higher or lower levels in individuals carrying a particular 
disease or type of disease is a useful surrogate marker. 

A surrogate marker can be used in diagnosis of a bone disorder. For example, 
consider a child that present to a physician with a high frequency of bone fracture. The 
10 underlying cause may be child abuse, inappropriate behavior by the child, or a bone disorder. 
To rapidly test for a bone disorder, the levels of the surrogate marker protein are measured 
using the antibody described above. 

Levels of modification states of surrogate markers can be measured as indicators of 
the likely effectiveness of a drug that is being developed. It is especially convenient to use 
15 surrogate markers in creating treatments for bone disorders, because alterations in bone 

development or mineralization may require a long time to be observed. For example, a set of 
bone mRNAs, termed the "HBM-inducible rnKNA set" is found to be overexpressed in 
HBM/+ mice as compared to +/+ mice, as described above. Expression of this set can be 
used as a surrogate marker. Specifically, if treatment of +/+ mice with a compound results in 
20 overexpression of the HBM-inducible mKNA set, then that compound is considered a 
promising candidate for further development 

This invention is particularly useful for screening compounds by using the Zmaxl or 
HBM protein or binding fragment thereof in any of a variety of drug screening techniques. 
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The Zmaxl or HBM protein or fragment employed in such a test may either be free in 
solution, affixed to a solid support, or bome on a cell surface. One method of drug screening 
utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the protein or fragment, preferably in competitive binding assays. 
5 Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, for the formation of complexes between a Zmaxl or HBM protein or 
fragment and the agent being tested, or examine the degree to which the formation of a 
complex between a Zmaxl or HBM protein or fragment and a known ligand is interfered with 
by the agent being tested. 
10 Thus, the present invention provides methods of screening for drugs comprising 

contacting such an agent with a Zmaxl or HBM protein, or fragment thereof, and assaying (i) 
for the presence of a complex between the agent and the Zmaxl or HBM protein or fragment, 
or (ii) for the presence of a complex between the Zmaxl or HBM protein or fragment and a 
ligand, by methods well known in the art. In such competitive binding assays the Zmaxl or 
15 HBM protein or fragment is typically labeled. Free Zmaxl or HBM protein or fragment is 
separated from that present in a proteimprotein complex, and the amount of free (i.e., 
uncomplexed) label is a measure of the binding of the agent.being tested to Zmaxl or HBM 
or its interference with Zmaxl or HBM: ligand binding, respectively. 

Another technique for drug screening provides high throughput screening for 
20 ' compounds having suitable binding affinity to the Zmaxl or HBM proteins and is described 
in detail in WO 84/03564. Briefly stated, large numbers of different small peptide test 
compounds are synthesized on a solid substrate, such as plastic pins or some other surface. 
The peptide test compounds are reacted with Zmaxl or HBM proteins and washed. Bound 
Zmaxl or HBM protein is then detected by methods well known in the art. Purified Zmaxl 
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or HBM can be coated directly onto plates for use in the aforementioned drug screening 
techniques. However, non-neutralizing antibodies to the protein can be used to capture 
antibodies to immobilize the Zmaxl or HBM protein on the solid phase. 

This invention also contemplates the use of competitive drug screening assays in 

5 which neutralizing antibodies capable of specifically binding the Zmaxl or HBM protein 
compete with a test compound for binding to the Zmaxl or HBM protein or fragments 
thereof. In this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants of the Zmaxl or HBM protein. 

A further technique for drug screening involves the use of host eukaryotic cell lines or 

10 cells (such as described above) that have a nonfunctional Zmaxl or HBM gene. These host 
cell lines or cells are defective at the Zmaxl or HBM protein level. The host cell lines or 
cells are grown in the presence ofdrug compound. The rate of growth of the host cells or 
impact on lipid metabolism is measured to determine if the compound is capable of 
regulating the growth or lipid metabolism of Zmaxl or HBM defective cells. 

15 The goal of rational drug design is to produce structural analogs of biologically active 

proteins of interest or of small molecules with which they interact (e.g., agonists, antagonists, 
inhibitors) in order to fashion drugs which are, for example,.more active or stable forms of 
the protein, or which, e.g., enhance or interfere with the function of a protein in vivo. See, 
e.g., Hodgson, Bio/Technology, 9:19-21 (1991). In one approach, one first determines the 

20 three-dimensional structure of a protein of interest (e.g., Zmax 1 or HBM protein) or, for 
example, of the Zmaxl- or HBM-receptor or ligand complex, by X-ray crystallography, by 
computer modeling or most typically, by a combination of approaches. Less often, useful 
information regarding the structure of a protein may be gained by modeling based on the 
structure of homologous proteins. An example of rational drug design is the development of 
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HIV protease inhibitors (Erickson et al, Science, 249:527-533 (1990)). In addition, peptides 
(e.g., Zmaxl or HBM protein) are analyzed by an alanine scan (Wells, Methods in EnzymoL, 
202:390-41 1 (1991)). In this technique, an amino acid residue is replaced by Ala, and its 
effect on the peptide's activity is determined. Each of the amino acid residues of the peptide 

5 is analyzed in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, 
and then to solve its crystal structure. In principle, this approach yields a pharmacore upon 
which subsequent drug design can be based. It is possible to bypass protein crystallography 
altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically 

10 active antibody. As a rnirror image of a mirror image, the binding site of the anti-ids would 
be expected to be an analog of the original receptor. The anti-id could then be used to 
identify and isolate peptides from banks of chemically or biologically produced banks of 
peptides. Selected peptides would then act as the pharmacore. 

Thus, one may design drugs which have, e.g., improved Zmaxl or HBM protein 

15 activity or stability or which act as inhibitors, agonists, antagonists, etc. of Zmaxl or HBM 
protein activity. By virtue of the availability of cloned Zmaxl or HBM sequences, sufficient 
amounts of the Zmaxl or HBM protein may be made available to perform such analytical 
studies as X-ray crystallography. In addition, the knowledge of the Zmaxl or HBM protein 
sequence provided herein will guide those employing computer modeling techniques in place 

20 of, or in addition to x-ray crystallography. 

XIX. Methods of Use: Avian and Mammalian Animal Husbandry 

The Zmaxl DNA and Zmaxl protein and/or the HBM DNA and HBM protein can be 
used for vertebrate and preferably human therapeutic agents and for avian and mammalian 
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veterinary agents, including for livestock breeding. Animals contemplated as subjects 
include livestock (e.g., cattle, pigs, sheep, goats, horses, buffalo, etc.), primates, canines, 
felines, rodents, birds, as well as reptiles, fish, and amphibians. Birds, including, for 
example, chickens, roosters, hens, turkeys, ostriches, ducks, pheasants and quails, can benefit 

5 from the identification of the gene and pathway for high bone mass. In many examples cited 
inliterature (for example, McCoy * al, Res. Vet. Sci., 60(2):185-186 (1996)), weakened 
bones due to husbandry conditions cause cage layer fatigue, osteoporosis and high mortality 
- rates. Additional therapeutic agents to treat osteoporosis or other bone disorders in birds can 
have considerable beneficial effects on avian welfare and the economic conditions of the 

10 • livestock industry, including, for example, meat and egg production. 

XX. Methods of use: Diagnostic assays using Zmaxl-specific oligonucleotides for 
detection of genetic alterations affecting bone development and lipid regulation. 

In cases where an alteration or disease of bone development or lipid metabolism is 
suspected to involve an alteration of the Zmaxl gene or the HBM gene, specific 
15 oligonucleotides may be constructed and used to assess the level of Zmaxl mRNA or HBM 
mRNA, respectively, inbone tissue or in another tissue that.affects bone development. 

ForexamplMotestwhemerape 
andhpidregulatio^polymerasechainreactioncanbeused. Two oligonucleotides are 

synthesized by standard methods or are obtained from a commercial supplier of custom-made 
' 20 ongonucleotide,Thelengfc^^ 

the Oligo 4.0 primer Picking program (Wojchich Rychlik, 1992). One of the 
oligonucleotides is designed so that it will hybridize only to HBM DNA under the PGR. 
conditions used. The other oligonucleotide is designed to hybridize a segment of Zmaxl 
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genomic DNA, such that amplification of DNA using these oligonucleotide primers produces 
a conveniently identified DNA fragment. For example, the pair of primers 
CCAAGTTCTGAGAAGTCC (SEQ ID NO:32) and AATACCTGAAACCA TACCTG 
(SEQ ID NO:33) will amplify a 530 base pair DNA fragment from a DNA sample when the 
5 following conditions are used: step 1 at 95°C for 120 seconds; step 2 at 95°C for 30 seconds; 
step 3 at 5S°C for 30 seconds; step 4 at 72°C for 120 seconds; where steps 2-4 are repeated " 
■:. 35 times. Tissue samples may be obtained from hair follicles, whole blood, or the buccal 
cavity. 

The fragment generated by the above procedure is sequenced by standard techniques. 
10 Individuals heterozygous for the HBM gene will show an equal amount of G and T at the 
second position in the codon for glycine 171. Normal or homozygous wild-type individuals 
will show only G at this position. 

Other amplification techniques besides PCR may be used as alternatives, such as 
ligation-mediated PCR or techniques involving Q-beta replicase (Cahill et ah, Clin. Cliem., 
15 37(9): 1482-5 (1991)). For example, the oligonucleotides AGCTGCTCGTAGCTGTCTCT 
CCCTGGATCACGGGTACATGTACTGGACAGACTGGGT (SEQIDNO:34) and 
TGAGACGCCCCGGATTGAGCGGGCAGGGATAGCTTATTCCCTGTGCCGCATTACG 

GC (SEQ ID NO:35) can be hybridized to a denatured human DNA sample, treated with a 
DNA ligase, and then subjected to PCR amplification using the primer oligonucleotides 
20 AGCTGCTCGTAG CTGTCTCTCCCTGGA (SEQ ID NO:36) and 

GCCGTAATGCGGCACAGGGAATAAGCT (SEQ ID NO:37). In the first two 
oligonucleotides, the outer 27 bases are random sequence corresponding to primer binding 
sites, and the inner 30 bases correspond to sequences in the Zmaxl gene. The T at the end of 
the first oligonucleotide corresponds to the HBM gene. The first two oligonucleotides are 



-111- 



BNSDOC1D: <WO 0192B91A2J_> 



WO 01/92891 



PCT/US01/16946 



ligated only when hybridized to human DNA carrying the HBM gene, which results in the 
formation of an ampliflable 1 14 bp DNA fragment. 

Products of amplification can be detected by agarose gel electrophoresis, quantitative 
hybridization, or equivalent techniques for nucleic acid detection known to one skilled in the 
5 art of molecular biology (Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring, NY (1989)). 

Other alterations in the Zmaxl gene or the HBM gene may be diagnosed by the same 
type of amplification-detection procedures, by using oligonucleotides designed to identify 
those alterations. These procedures can be used in animals as well as humans to identify 
10 alterations in Zmaxl or HBM that affect bone development and/or lipid metabolism or levels. 
Expression of Zmaxl or HBM in bone tissue may be accomplished by fusing the 
cDNA of Zmaxl or HBM, respectively, to a bone-specific promoter in the context of a vector 
for genetically engineering vertebrate cells. DNA constructs are introduced into cells by 
packaging the DNA into virus capsids, by the use of cationic liposomes, electroporation, or 
15 by calcium phosphate transfection. Transfected cells, preferably osteoblasts, may be studied 
in culture or may be introduced into bone tissue in animals by direct injection into bone or by 
intravenous injection of osteoblasts, followed by incorporation into bone tissue (Ko et al, 
Cancer Research, 56(20):4614-9 (1996)). For example, the osteocalcin promoter, which is 
specifically active in osteoblasts, may be used to direct transcription of the Zmaxl gene or the 
20 HBM gene. Any of several vectors and transfection methods may be used, such as retroviral 
vectors, adenovirus vectors, or vectors that are maintained after transfection using cationic 
liposomes, or other methods and vectors described herein. 

Similarly Zmaxl, or HBM can be expressed in liver tissue or in other lipid- 
metabolism or Upid-regulating cells, such as lipid laden foam cells or lesion cells. This can 
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be accomplished by fusing the cDNA of Zmaxl or HBM respectively to, for example, a liver 
specific promoter or other suitable promoter in the context of a vector for genetically 
engineering vertebrate cells. DNA constructs are introduced into cells by packaging the DNA 
into, for example,' virus capsids, by the use of canonic liposomes, electroporation, or calcium 

. 5 phosphate transfection. The transfected cells, preferably liver cells, may be studied in culture 
or can be introduced into animals by direct injection into the liver or other cell involved in 
lipid regulation or metabolism. The vectors and transfection methods to be used are similar 
to those described herein. 

Alteration of the level of functional Zmaxl protein or HBM protein affects the level 

10 of bone mineralization and lipid levels. By manipulating levels of functional Zmaxl protein 
or HBM protein, it is possible to affect bone development and to increase or decrease levels 
of bone mineralization as well as lipid levels. For example, it may be useful to increase bone 
mineralization in patients with osteoporosis. Alternatively, it may be useful to decrease bone 
mineralization in patients with osteopetrosis or Pager's disease. Alteration of Zmaxl levels or 

15 HBM levels can also be used as a research tool. Specifically, it is possible to identify 

proteins, mRNA and other molecules whose level or modification status is altered in response 
to changes in functional levels of Zmaxl or HBM. The pathology and pathogenesis of bone 
disorders is known and described, for example, in Rubin and Farber (Eds.), Pathology, 2nd 
Ed., S.B. Lippincott Co., Philadelphia, PA (1994). 

20 Zmaxl or HBM protein levels can be altered to regulate lipid levels in a cell or a 

subject The pathology and pathogenesis of atherosclerosis and arteriosclerosis is known and 
described, for example, in Edwin L. Bierman, "Atherosclerosis and Other Forms of 
Arteriosclerosis," in Harrison's Principles of Internal Medicine, 1106-1116 (13th Ed., 1994). 
Modulation of lipid levels may be useful to lower certain levels of lipids (e.g., LDL) in 
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patients with arteriosclerosis and/or atherosclerosis, as well as conditions and diseases 
affiliated with atherosclerosis and arteriosclerosis, as described by Bierman (1994). 

A variety of techniques can be used to alter the levels of functional Zmaxl or HBM. 
For example, intravenous or intraosseous injection of the extracellular portion ofZmaxl or 
5 mutations thereof, or HBM or mutations thereof, will alter the level ofZmaxl activityor 
HBM activity, respectively, in the body of the treated human, animal or bird. Truncated 
versions of the Zmaxl protein or HBM protein can also be injected to alter the levels of 
functional Zmaxl protein or HBM protein, respectively. Certain forms ofZmaxl or HBM 
enhance the activity of endogenous protein, while other forms are inhibitory. 
10 In a preferred embodiment, the HBM protein is used to treat osteoporosis or 

arteriosclerosis. In a further preferred embodiment, the extracellular portion of the HBM 
protein is used. This HBM protein may be optionally modified by the addition of a moiety 
that causes the protein to adhere to the surface of cells. The protein is prepared in a 
pharmaceutical^ acceptable solution and is administered by injection or another method that 
15 achieves acceptable pharmacokinetics and distribution. 

In a second embodiment of this method, Zmaxl or HBM levels are increased or 
decreased by gene therapy techniques. To increase ZmaxLor HBM levels, osteoblasts or 
another useful cell type are genetically engineered to express high levels of Zmaxl or HBM 
as described above. Alternatively, to decrease Zmaxl or HBM levels, antisense constructs 
20 that specificaUy reduce U 
general, a tissue-nonspecific promoter may be used, such as the CMV promoter or another 
commercially available promoter found in expression vectors (Wu et al, Toxicol. Appl 
Pharmacol, 141(l):330-9 (1996)). In a preferred embodiment, a Zmaxl cDNAorits 
antisense is transcribed by a bone-specific promoter, such as the osteocalcin or another 
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promoter, to achieve specific expression in bone tissue. In this way, if a Zmaxl -expressing 
DNA construct or HBM-expressing construct is introduced into non-bone tissue, it will not be 
expressed. Similarly, if a liver-specific promoter is used to express the HBM or Zmaxl 
proteins in liver or other cell involved in lipid regulation or metabolism, the DNA construct 

5 with, for example, a liver-specific promoter will not be expressed in other non-liver tissues. 

In a third embodiment of this method, antibodies against Zmaxl or HBM are used to 
inhibit its function. Such antibodies are identified herein. 

In a fourth embodiment of this method, drugs that inhibit Zmaxl function or HBM 
function are used. Such drugs are described herein and optimized according to techniques of 

10 medicinal chemistry well known to one skilled in the art of pharmaceutical development. 

Zmaxl and HBM interact with several proteins, such as ApoE. Molecules that inhibit 
the interaction between Zmaxl or HBM and ApoE or another binding partner are expected to 
alter bone development and mineralization. Such inhibitors may be useful as drugs in the 
treatment of osteoporosis, osteopetrosis, or other diseases of bone mineralization. Such 

15 inhibitors may be low molecular weight compounds, proteins or other types of molecules. 
See, Kim et al.,J. Biochem. (Tokyo), 124(6): 1072-1076 (1998). 

Inhibitors of the interaction between Zmaxl or HBM and interacting proteins may be 
isolated by standard drug-screening techniques. For example, Zmaxl protein, (or a fragment 
thereof) or HBM protein (or a fragment thereof) can be immobilized on a solid support such 

20 as the base of microtiter well. A second protein or protein fragment, such as ApoE is 

derivatized to aid in detection, for example with fluorescein. Iodine, or biotin, then added to 
the Zmaxl or HBM in the presence of candidate compounds that may specifically inhibit this 
protein-protein domain of Zmaxl or HBM, respectively, and thus avoid problems associated 
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with its transmembrane segment. Drug screens of this type are well known to one skilled in 
the art of pharmaceutical development. 

Because Zmaxl and HBM are involved in bone development and lipid regulation, 
proteins that bind to Zmaxl and HBM are also expected to be involved in bone development 
5 and lipid regulation. Such binding proteins can be identified by standard methods, such as 
co-immunoprecipitation, co-fxactionation, or the two-hybrid screen (Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons (1997)). For example, to identify 
Zmaxl-interacting proteins or HBM-interacting proteins using the two-hybrid system, the 
extracellular domain of Zmaxl or HBM is fused to LexA and expressed for the yeast vector 
10 pEG202 (the "bait") and expressed in the yeast strain EGY48. The yeast strain is transformed 
with a "prey" library in the appropriate vector, which encodes a galactose-inducible 
transcription-activation sequence fused to candidate interacting proteins. The tedhniques for 
initially selecting and subsequently verifying interacting proteins by this method are well 
known to one skilled in the art of molecular biology (Ausubel et al, Current Protocols in 
15 Molecular Biology, John Wiley & Sons (1 997)). 

In a preferred embodiment, proteins that interact with HBM, but not Zmaxl, are 
identified using a variation of the above procedure (Xu et al^Proc. Natl. Acad. Sci. USA, 
94(23): 12473-8 (Nov. 1997))'. This variation of the two-hybrid system uses two baits, and 
Zmaxl and HBM are each fused to LexA and TetR, respectively. Alternatively, proteins that 
20 interact with the HBM but not Zmaxl are also isolated. These procedures are weU known to 
one skilled in the art of molecular biology, and are a simple variation of standard two-hybrid 
procedures. 

As an alternative method of isolating Zmaxl or HBM interacting proteins, a 
biochemical approach is used. The Zmaxl protein or a fragment thereof, such as the 
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extracellular domain, or the HBM protein or a fragment thereof, such as the extracellular 
domain, is chemically coupled to Sepharose beads. The Zmaxl- or HBM-coupled beads are 
poured into a column. An extract of proteins, such as serum proteins, proteins in the 
supernatant of a bone biopsy, or intracellular proteins from gently lysed TE85 osteoblastic 
5 cells, is added to the column. Non-specifically bound proteins are eluted, the column is 

washed several times with a low-salt buffer, and then tightly binding proteins are eluted with 
a high-salt buffer. These.are candidate proteins that bind to Zmaxl or HBM, and can be 
tested for specific binding by standard tests and control experiments. Sepharose beads used 
for coupling proteins and the methods for perfonning the coupling are commercially 
10 available (Sigma), and the procedures described here are well known to one skilled in the art 
of protein biochemistry. 

As a variation of the above procedure, proteins that are eluted by high salt from the 
Zmaxl- or HBM-Sepharose column are then added to an HBM-Zmaxl-sepharose column. 
Proteins that flow through without sticking are proteins that bind to Zmaxl but not to HBM. 
15 Alternatively, proteins that bind to the HBM protein and not to the Zmaxl protein can be 
isolated by reversing the order in which the columns are used. Similar columns can be 
prepared for use in assessing lipid regulation in liver and other tissues and cells involved in 
lipid regulation and or metabolism. 

XXI. Method of Use: Transformation- Associated Recombination (TAR) Cloning 

20 Essential for the identification of novel allelic variants of Zmaxl is the ability to 

examine the sequence of both copies of the gene in an individual. To accomplish this, two 
"hooks," or regions of significant similarity, are identified within the genomic sequence such 
that they flank the portion of DNA that is to be cloned. Most preferably, the first of these 
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hooks is derived from sequences 5 ! to the first exon of interest and the second is derived from 
sequences 3' to the last- exon of interest. These two "hooks" are cloned into a bacterial/yeast 
shuttle vector such as that described by Larionov et al. 9 Proc. Natl Acad. Sci. USA, 94:7384- 
7387 (1997). Other similar vector systems may also be used. To recover the entire genomic 
5 copy of the Zmaxl gene, the plasmid containing the two "hooks" is linearized with a 

restriction endonuclease or is produced by another method such as PGR. This linear DNA 
fragment is introduced into yeast cells along with human genomic DNA. Typically, the yeast 
Saccharomyces cerevisiae is used as a host cell, although chicken host cells can be used as 
well (Larionov et aL, Genet Eng. (NY). 21:37-55 (1999). During and after the process of 
10 transformation, the endogenous host cell converts the linear plasmid to a circle by a 

recombination event whereby the region of the human genomic DNA homologous to the 
"hooks" is inserted into the plasmid. This plasmid can be recovered and analyzed by methods 
well known to one skilled in the art. Obviously, the specificity for this reaction requires the 
host cell machinery to recognize sequences similar to the "hooks" present in the linear 
15 fragment. However, 100% sequence identity is not required, as shown by Kouprrna et aL, 
Genomics, 53(l):21-28 (October 1998), where the author describes using degenerate repeated 
sequences common in the human genome to recover fragments of human DNA from a 
rodent/human hybrid cell line. 

In another example, only one "hook" is required, as described by Larionov et al 9 
20 Proa Natl Acad. Sci. USA, 95(8):4469-74 (April 1998). For this type of experiment, termed 
"radial TAR cloning," the other region of sequence similarity to drive the recombination is 
derived from a repeated sequence from the genome. In this way, regions of DNA adjacent to 
the Zmaxl gene coding region can be recovered and examined for alterations that may affect 
function. 
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XXII. Methods of Use: Genomic Screening 

The use of polymorphic genetic markers linked to the HBM gene or to Zmaxl is very 
useful in predicting susceptibility to osteoporosis or other bone diseases. Polymorphic 
genetic markers linked to the HBM gene or the Zmaxl gene also can be used to predict 
5 susceptibility to arteriosclerosis or atherosclerosis and conditions related thereto. Koller et 
al.,Amer. J. Bone Min. Res., 13:1903-1908 (1998) have demonstrated that the use of 
polymorphic genetic markers is useful for linkage analysis. Similarly, the identification of 
polymorphic genetic markers within the HBM gene will allow the identification of specific 
allelic variants that are in linkage disequilibrium with other genetic lesions that affect bone 
10 development. Using the DNA sequence from the BACs, a dinucleotide CAn repeat was 
identified and two unique PCR primers that will amplify the genomic DNA containing this 
repeat were designed, as shown below: 

B200E21C16_L: GAGAGGCTATATCCCTGGGC (SEQ ID NO:38) 
B200E21C16_R: ACAGCACGTGTTTAAAGGGG (SEQ ID NO:39) 

15 and used in the genetic mapping study. 

This method has been used successfully by others skilled in the art (e.g., Sheffield et 
al., Genet, 4:1837-1844 (1995); LeBlanc-Straceski et al, Genomics, 19:341-9 (1994); Chen 
et al, Genomics, 25:1-8 (1995)). Use of these reagents with populations or individuals will 
predict their risk for osteoporosis. Similarly, single nucleotide polymorphisms (SNPs), such 

20 as those shown in Table 4 above, can be used as well to predict risk for developing bone 
diseases or resistance to osteoporosis in the case of the HBM gene. It is also contemplated 
that single nucleotide polymorphisms (SNPs) such as those described above, may be used to 
predict the risk in a subject for developing arteriosclerosis and atherosclerosis and related 
conditions. 
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XXIII, Methods of Use: Modulators of Tissue Calcification 

The calcification of tissues in the human body is well documented. Towler et al^ J. 
Biol Chem., 273:30427-34 (1998) demonstrated that several proteins known to regulate 
calcification of the developing skull in a model system are expressed in calcified aorta* The 
5 expression of Msx2, a gene transcribed in osteoprogenitor cells, in calcified vascular tissue 
indicates that genes which are important in bone development are involved in calcification of 
other tissues. Treatment with HBM protein, agonists or antagonists is likely to ameliorate 
calcification (such as the vasculature, dentin and bone of the skull visera) due to its 
demonstrated effect on bone mineral density. In experimental .systems where tissue 

10 calcification is demonstrated, the over-expression or repression of Zmaxl activity permits the 
identification of molecules that'are directly regulated by the Zmaxl gene. These genes are 
potential targets for therapeutics aimed at modulating tissue calcification. For example, an 
animal, such as the LDLR -/-, mouse is fed a high fat diet and is observed to demonstrate 
expression of markers of tissue calcification, including Zmaxl. These animals are then 

15 treated with antibodies to Zmaxl or HBM protein, antisense oligonucleotides directed against 
Zmaxl or HBM cDNA, or with compounds known to bind the Zmaxl or HBM protein or its 
binding partner or ligancL RNA or proteins are extracted .from the vascular tissue and the 
relative expression levels of the genes expressed in the tissue are determined by methods well 
known in the art. Genes that are regulated in the tissue are potential therapeutic targets for 

20 pharmaceutical development as modulators of tissue calcification. 

The nucleic acids, proteins, peptides, amino acids, small molecules or other 
pharmaceutically useful compounds of the present invention that are to be given to an 
individual may be administered in the form of a composition with a pharmaceutically 
acceptable carrier, excipient or diluent, which are well known in the art. The individual may 
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be a mammal or a bird, preferably a human, a rat, a mouse or bird. Such compositions may 
be acmiinistered to an individual in a pharmaceutically effective amount. The amount 
administered will vary depending on the condition being treated and the patient being treated. 
The compositions may be administered alone or in combination with other treatments. 

5 XXTV. Pharmaceutical Compositions 

The invention also contemplates pharmaceutical compositions comprising a lipid 
mediating agent which modulates HBM and/or Zmaxl activity in combination with a 
lipoprotein modulating agent (e.g., blofibrate, gemfibrozil, nicotinic acid, cholestyramine, 
cholestipol, lovastatin, simvastatin, pravastain, probucol, premarin or estradiol.). Liprotein 
10 modulating agents can include compounds or compositions which modulate (e.g. , up-regulate 
or down-regulate) LDL, VLDL, HDL or DDL levels. 

The lipid mediating agent, which modulates HBM and/or Zmaxl activity, can include 
proteins, monoclonal antibodies or fragments thereof, chemicals, and mimetics. One 
contemplated pharmaceutical composition can comprise the monoclonal antibody and a 
15 pharmaceutically acceptable carrier. For the purposes of the present invention, a 

"pharmaceutically acceptable carrier" can be any of the standard carriers well known in the 
art For example, suitable carriers can include phosphate buffered saline solutions, emulsions 
such as oil/water emulsions, and various types of wetting agents. Other carriers can also 
include sterile solutions, tablets, coated tablets, and capsules. Typically, such carriers can 
20 also contain excipients such as starch, milk, sugar, types of clay, gelatin, stearic acid, or salts 
thereof, magnesium or calcium sterate, talc, vegetable fats or oils, gums, glycerols, or oilier 
known excipients. Such carriersxan also include flavors and color additives, preservatives, 
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or other ingredients. Compositions comprising such carriers are formulated by well known 
conventional means. See REMINGTON'S PHARMACEUTICAL SCIENCE (1 5th ed. 1980). 

For diagnostic purposes, the antibodies and recombinant binding proteins can be 
either labeled or unlabeled. Typically, diagnostic assays entail detecting the formation of a 
5 complex through the binding of the monoclonal antibody or recombinant binding protein to a 
HBM protein or Zmaxl protein. Wren unlabeled, the antibodies and recombinant binding 
proteins find use in agglutination assays. In addition, unlabeled antibodies can be used in 
combination with outer labeled antibodies (second antibodies) mat are specifically reactive 
with the monoclonal antibody or recombinant binding protein, such as antibodies specific for 
10 immnnoglobulim Alternatively, the monoclonal antibodies and recombinant binding proteins 
can be directly labeled. A wide variety of labels canbe employed, such as radionuclides 
(eg, To, '"In, m I and »'D, fluoresces enzymes, enzyme substrates, enzyme cefaclors, 
Jyme inhibitors, ligands (particularly haptens), *c Numerous types of immunoassays are 
well known in the art 

15 Commonly, me monoclonal antibodies and recombinant binding proteins of the 

present invention are used in fluorescent assays, where the subject antibodies or recombinant 
Wmgproteto are conjugated toafluoresc^^ 

(Frrc). 

' The examples provided below are not meant to limit the invention in any way, hut 
20. serve to provide preferred embodiments for the invention. 

FVAMPLES 

The present invention is described by reference to the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. 
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Standard techniques well known in the art or the techniques specifically described below 
were utilized. 

Example 1 

The propositus was referred by her physicians to the Creighton Osteoporosis Center 
for evaluation of what appeared to be unusually dense bones. She was 1 8 years old and came 
to medical attention two years previous because of back pain, which was precipitated by an 
auto accident in which the car in which she was riding as a passenger was struck from behind. 
Her only injury was soft tissue injury to her lower back that was manifested by pain and 
muscle tenderness. There was no evidence of fracture or subluxation on radiographs. The 
pain lasted for two years, although she was able to attend school full time. By. the time she 
was seen in the Center, the pain was nearly resolved and she was back to her usual activities 
as a high school student. Physical exam revealed a normal healthy young woman standing 66 
inches and weighing 128 pounds. Radiographs of the entire skeleton revealed dense looking 
bones with thick cortices. All bones of the skeleton were involved. Most importantly, the 
shapes of all the bones were entirely normal. The spinal BMC was 94.48 grams in Ll-4, and 
the spinal BMD was 1.667 gm/cm 2 in Ll-4. BMD was 5.62_standard deviations (SD) above 
peak skeletal mass for women. These were measured by DXA using a Hologic 2000-. Her 
mother was then scanned and a lumbar spinal BMC of 58.05 grams and BMD of 1.500 
gm/cm 2 were found. Her mother's values place her 4.12 SD above peak mass and 4.98 SD 
above her peers. Her mother was 51 years old, stood 65 inches and weighed 140 pounds. 
Her mother was in excellent health with no history of musculoskeletal or other symptoms. 
Her father's lumbar BMC was 75.33 grains and his BMD was 1.118 gm/cm 2 . These values 
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place him 0.25 SD above peak bone mass for males. He was in good health, stood 72 inches 
tall, and weighed 187 pounds. 

These clinical data suggested that the propositus inherited a trait from her mother, 
which resulted in very high bone mass, but an otherwise normal skeleton, and attention was 
5 focused on the maternal kindred. In U.S. Patent No. 5,691,153, twenty- two of these 

members had measurement of bone mass by DXA. In one case, the maternal grandfather of 
thepropositus, was deceased, however, medical records, antemortem skeletal radiographs and 
a gall bladder specimen embedded in paraffin for DNA genotyping were obtained. His 
radiographs showed obvious extreme density of all of the bones available for examination 
10 including the femur and the spine, and he was included among the affected members. In this 
invention, the pedigree has been expanded to include 37 informative individuals. These 
additions are a significant improvement over the original kinship (Johnson et al., Am. J. Hum. 
Genet., 60:1326-1332 (1997)) because, among the fourteen individuals added since the 
original study, two individuals harbor key crossovers. X-linkage is ruled out by the presence 
15 of male-to-male transmission from individual 12 to 14 and 15. 



Example 2 

The present invention describes DNA sequences derived from two BAC clones from 
the HBM gene region, as evident in Table 6 below, which is an assembly of these clones. 
Clone b200e21-h (ATCC No. 980812; SEQ ID NOS: 10-11) was deposited at the American 
20 Type Culture Collection (ATCC), 10801 University Blvd., Manassas, VA 201 10-2209 
U.S.A., on December 30, 1997. Clone b527dl2-h (ATCC No. 980720; SEQ ID NOS: 5-9) 
was deposited at the American Type Culture Collection (ATCC), 10801 University Blvd., 
Manassas, VA 201 10-2209 U.S.A., on October 2, 1998. These sequences are unique reagents 
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that can be used by one skilled in the art to identify DNA probes for the Zmaxl gene, PCR 
primers to amplify the gene, nucleotide polymorphisms in the Zmaxl gene, or regulatory 
elements of the Zmaxl gene. 

TABLE 6 



Contig 


ATCC No. 


SEQ ID NO. 


Length 


b527dl2-h_contig302G 


980720 


5 


3096 


b527dl2-h_contig306G 


980720 


6 


26928 


b527dl2-h_contig307G 


980720 


7 


29430 


b527dl2-h_contig308G 


980720 


8 


33769 


b527dl2-h_contig309G 


980720 


9 


72049 


b200e21-h contigl 


980812 


10 


8705 


b200e21-h_contig4 


980812 


11 


66933 



The disclosure of each of the patents, patent applications and publications cited in the 
specification is hereby incorporated by reference herein in its entirety. 
15 Although the invention has been set forth in detail, one skilled in the art will 

recognize that numerous changes and modifications can bemade, and that such changes and 
modifications may be made without departing from the spirit and scope of the invention. 

Example 3 

Since Zmaxl has similarity to the LDL receptor family of genes, it may be involved 
20 in lipid metabolism. However, others have reported that lipid profile variables did not show 
significant association with bone mass and could not be used as indicators for bone mineral 
density (Zabaglia et al, "An exploratory study of association between lipid profile and bone 
mineral density in menopausal women in a Campinas reference hospital," Cad. Saude 
Publico 14: 779-86 (1998)). Zmaxl may be normally involved in regulating bone density by 
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depositing calcium during bone remodeling. The HBM mutation may result in increased 
deposition thus conferring denser bone structure. Interestingly, atherosclerotic plaques 
contain calcified material and express a variety of genes involved in bone differentiation. 
To test whether the HBM gene was involved in lipid regulation, biochemical tests 

5 were performed to measure serum level of various lipid containing molecules or precursors in 
affected and unaffected HBM family members to test whether the HBM mutation in the 
Zmaxl gene effects lipid metabolism. Table 7 shows the results of testing eight HBM 
individuals and seven unaffected individuals. Wilcoxon rank-sum tests (non-parametric 
equivalent of a T-test) were performed to assess whether levels of biochemical markers from 

10 affected HBM individuals deviated from unaffected individuals. The data obtained were 

analyzed separately by gender, as well as by combining values from males and females, when 
appropriate. 

Standard diagnostic protocols were used to dete rmin e the concentration (mg/dL) with 
triglycerides, cholesterol, high density lipoprotein (HDL), low density lipoprotein (LDL), 

15 very low density lipoprotein (VLDL), apolipoprotein A-l (APO A-l), apolipoprotein B (APO 
B), and lipoprotein a (LIPOa). For such procedures, see for example, F. W. HEMMING, LIPID 
Analysis (Bios Scientific Pub. 1996) and j. M. Ordovas, Lipoprotein Protocols 
(Humana Press Inc., 1997). The genotype for apolipoprotein E (APO E) was also reported. 
There are three common alleles (e.g.,- E2, E3 and E4). The affected and unaffected HBM 

20 family members are heterozygous or homozygous for the alleles. 

The results obtained were statistically significant: . (1) Triglyceride levels were 
generally lower in affected individuals than in unaffected individuals, and (2) very low 
density lipoprotein (VLDL) levels were generally lower in affected individuals than in 
unaffected individuals. Additionally, the following comparisons approached statistical 

25 significance (p=0.06): (1) high density lipoprotein (HDL) levels were higher in affected 
males than in unaffected males, and (2) the ratio of low density lipoprotein (LDL) to high 
density lipoprotein (HDL) was generally higher in affected males than in unaffected males. 

In Table 7, "ARUP" is ARUP Laboratories, 500 Chipeta Way, Salt Lake City, UT 
84108 where one of the studies was performed. "SJH" refers to the second center which 
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performed these studies, Creighton Medical Laboratories, 28th & Burt, Dental-Rm 306, 
Omaha, NE 68178. APO-A1, APO-B and LIPOa are reported inmg.dL. Total serum levels 
also are in mg/dL. 

All cited patents and publications referred to in this application are herein 
5 incorporated by reference in their entirety. 
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CLAIMS 

What is claimed is: 

1 . A method of identifying a molecule involved in lipid regulation comprising 
identifying a molecule that binds to, or that inhibits binding of a molecule to, HBM or 
Zmaxl. 

2. The method of claim 2, wherein said molecule is a protein. 

3 . The method of claim 2, further comprising producing an antibody to the 
protein. 

4. A method for identifying a protein involved in lipid regulation comprising 
identifying a protein that has an expression level that is different in a first host comprising the 
Zmaxl gene when compared to a second host comprising the HBM gene. 

5 . The method of claim 4, wherein the host is an animal. 

6. A method for identification of a candidate molecule involved in lipid 

regulation comprising: 

(A) identifying a molecule that binds to, or that inhibits binding of a molecule to, 
the nucleic acid sequence of SEQ ID NO: 1 or a Zmaxl nucleic acid comprising a 
polymorphism of Table 4; 

(B) identifying a molecule that binds to, or that inhibits binding of a molecule to, 

the nucleic acid sequence of SEQ ID NO: 2; and 

(C) comparing the extent of binding, or the extent of inhibition of binding, of the 
molecule to each nucleic acid sequence, wherein the molecule that binds, or inhibits binding, 
more or less to the nucleic acid sequence of SEQ ID NO: 2 or the nucleic acid sequence of 
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SEQ ID NO: 1 or a Zmaxl nucleic acid comprising a polymorphism of Table 4 is the 
candidate molecule. 

7. The method of claim 6, wherein the candidate molecule is a protein, an mKNA 
or an antisense nucleic acid. 

8. A method for testing a substance as a therapeutic agent for modulating lipid 
levels comprising administering a nucleic acid comprising SEQ ID NO: 2 or a nucleic acid 
sequence with an HBM polymorphism to a subject, and assessing whether lipid levels are 
modulated. 

' 9. The method of claim 8, wherein the subject is an animal and the animal is 
selected from the group consisting of: livestock, primates, humans, canines, felines, rodents, 
birds, reptiles, fish, and amphibians. 

10. A method for testing a substance as a therapeutic agent for modulating lipid 
levels comprising administering a protein comprising SEQ ID NO: 4 or a Zmaxl protein 
comprising a polymorphism of Table 4 to a subject, and assessing whether lipid levels are 
modulated. 

11. . A method of pharmaceutical development for treating lipid-mediated disorders 
comprising identifying a molecule that binds to the amino acid sequence of SEQ ID NO: 4 or 
to a Zmaxl protein comprising a polymorphism of Table 4. 

12. The method of claim 1 1, wherein the molecule inhibits or enhances the 
function of me amino acid. 

13. A method of pharmaceutical development for treatment of lipid-mediated 
disorders comprising: 
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(A) constructing a first host that contains the Zmaxl gene or protein; 

(B) constructing a second host that contains the HBM gene or protein; 

(C) analyzing a difference between the first host and the second host; and 

(D) identifying a molecule that, when added to the first host, causes the first host 
to exhibit a characteristic feature of the second host. 

14. The method of claim 13, wherein the host is a cell-free extract, a cell or an 

animal. 

15. The method of claim 13, wherein the difference is a surrogate marker. 

16. A method of regulating lipid levels in a host comprising administering the 
amino acid sequence comprising SEQ ID NO: 4 to a somatic cell or to a germ-line cell of a 
host suffering from a lipid-mediated disorder. 

17. The method of claim 16, wherein the host is livestock, primates, humans, 
canines, felines, rodents, birds, reptiles, fish, or amphibians. 

19. A method for treating or preventing a lipid-mediated disorder in an animal 
comprising transferring a nucleic acid sequence comprising SEQ ID NO: 2 or a Zmaxl 
nucleic acid comprising a polymorphism of Table 4 into a somatic cell or a germ-line cell 
of an animal suffering from a lipid-mediated disorder. 

20. The method of claim 19, wherein the animal is livestock, primates, humans, 
canines, felines, rodents, birds, reptiles, fish, or amphibians. 

21 . A method of treating or preventing arteriosclerosis or an arteriosclerosis- 
associated condition comprising adrninistering an amino acid sequence comprising SEQ ID 
NO: 4 to a patient in need thereof. 
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22. The method of claim 21, wherein the patient is livestock, primates, humans, 
canines, felines, rodents, birds, reptiles, fish, or amphibians. 

23 The method of claim 21 , wherein the amino acid sequence administered to a 
patient in need thereof comprises the extracellular domain of the amino acid sequence 
comprising SEQ ID NO: 4. 

24. The method of claim 21, wherein the amino acid sequence administered to a 
patient in need thereof comprises the intracellular domain of the amino acid sequence 
comprising SEQ ID NO: 4. 

25. A method for treating or preventing a lipid-mediated disorders comprising 
administering a molecule that binds to a nucleic acid sequence comprising SEQ ID NO: 2 or 
a Zmaxl nucleic acid comprising a polymorphism of Table 4 to a patient in need thereof. 

26. The method of claim 25, wherein the patient is livestock, primates, humans, 
canines, felines, rodents, birds, reptiles, fish, or amphibians. 

27 A method for treating or preventing lipid-mediated disorders comprising 
- admimstermga^an^ 

acid sequence comprising SEQ ID NO: 4. 

28 A method for diagnostic screening for a' genetic predisposition to 
arteriosclerosis or an arteriosclerosis associated condition or a lipid-mediated disorder 
comprising screening a sample from a patient Tvith a nucleotide sequence derived fern the 
genomic or cDNA nucleic acid sequence of HBM. 
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29. The method of claim 28, wherein the screening involves, performing a 
haplotype analysis using the nucleic acid sequence comprising SEQ ID NO: 2 and 
detenriining whether the subject contains the Zmaxl gene or lacks an HBM polymorphism. 

30. A diagnostic assay for detennining a predisposition for a lipid-mediated 
disorders comprising an antibody to the HBM protein and an antibody to the Zmaxl protein. 

31. A method of expressing the HBM protein in tissue comprising constructing an 
expression vector comprising a promoter that directs expression in tissue operably linked to 
SEQ ID NO:2 and the tissue in which the HBM protein is expressed is a lipid regulating cell 
or a cell involved in lipid metabolism. 

32. The method of claim 3 1, wherein the tissue is liver. 

33. The method of claim 3 1, wherein the promoter that directs expression in tissue 
is an osteocalcin promoter or an AML-3 promoter. 

34. A method of modulating lipid levels in a subject by administering an HBM 
protein or a Zmaxl protein comprising a polymorphism of Table 4. 

35 . The method of claim 34, wherein the HBM protein comprises SEQ ID NO: 4. 

36. The method of claim34, wherein the lipid modulated is selected from the 



group 



consisting of: VLDL, LDL T IDL, HDL, LIPOa, APO A-l, APO B and APO E. 



37. . A method of modulating lipid levels in a subject by administering an agent 
which regulates HBM or Zmaxl activity. 
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38 The method of claim 37, wherein the lipid modulated is selected from the 
group consisting of: VLDL, LDL, IDL, HDL, LXPOa, APO A-l. APO B and APO E. 

39 The method of claim 37, wherein the regulation of HBM or Zmaxl activity is 
modulates gene transcription, protein translation or Zmaxl or HBM protein binding to its 
cognate target thereby regulating lipid levels. 

40 ' A composition for treating a lipid-mediated condition comprising an agent that 
modulates Upid levels by regulating Zmaxl or HBM activity and a lipoprotein modulating 
agent with a pharmaceutically acceptable carrier. 

7 41 Thecompositionofclaim40 5 wheremthelipoprotemmodnlatingagentis 
blofibrate, gemfibrozil, nicotinic acid, cholestyramine, colestipol, lovastatin, simvastatin, 
pravastatin, probucol, premarin or estradiol. 

42. The composition of claim 40, wherein the lipoprotein modulating agent 
modulates LDL levels. 

43 The composition of claim 42, wherein the lipoprotein modulating agent is 
selected from *e group consisting ofbile acid binding resins, HMG-CoA reductase inhibitors 

and estrogens. 

44. A meliod of treating a subject suffering from a lipid-mediated condition 
comprising the step of administering the composition of claim 40. 

45. The method of claim 44, wherein the lipid-mediated condition is 
atherosclerosis, arteriosclerosis, or a disease associated with atherosclerosis or 
arteriosclerosis. 
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46. A combination therapy for treating a subject suffering from a lipid-mediated 
disease or condition comprising administering to a subject an agent which regulates HBM or 
Zmaxl and an agent which regulates a lipoprotein. 

47. The combination therapy of claim 46, wherein the agent regulating lipoprotein 
concentrations is blofibrate, gemfibrozil, nicotinic acid, cholestyrarnine, cholestipol, 
lovastatin, simvastatin, pravastain, probucol, premarin or estradiol. 

48. The method of claim 46, wherein tbe lipid-mediated disease is atherosclerosis, 
arteriosclerosis, an atherosclerosis associated condition or an arteriosclerosis associated . 
condition. 
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Exon 1 

ACTAAAGCGCCGCCGCCGCGCCATGGAGCCCGAGTGAGCGCGGCGCGG 
GCCCGTCCGGCCGCCGGACAACA1SGAGGCAGCGCCGCCCGGGCCGCC 
GTGGCCGCTGCTGCTGCTGCTGCTGCTGCTGCTGGCGCTGTGCGGCTGC 

CCGGCCCCCGCCGCGGCC 

Exon 2 Coordinates: 527dl2_Contig308G 30944-30549 

eccccacagCCTCGCCGCTCCTGCTATTTGCCAACCGCCGGGACGTACGGCT 

GGTGGACGCCGGCGGAGTCAAGCTGGAGTCCACCATCGTGGTCAGCGG 

GCTGGAGGATGCGGCCGCAGTGGACTTCCAGTTTTCCAAGGGAGCCGTG 

TACTGGACAGACGTGAGCGAGGAGGCCATCAAGCAGACCTACCTGAACC 

AGACGGGGGCCGCCGTGCAGAACGTGGTCATCTCCGGCCTGGTCTCTCC 

CGACGGCCTCGCCTGCGACTGGGTGGGCAAGAAGCTGTACTGGACGGA 

CTCAGAGACCAACCGCATCGAGGTGGCCAACCTCAATGGCACATCCCGG 

AAGGTGCTCTTCTGGCAGGACCTTGACCAGCCGAGGGCCATCGCCTTGG 

ACCCCGCTCACGGgtaaaccctgctg 
... 9408 nt ... 

Exon 3 Coordinates: 527dl2_Contig308G 21141-20945 

ccccgtcacagGTACATGTACTGGACAGACTGGGGTGAGACGCCCCGGATTGA 
GCGGGCAGGGATGGATGGCAGCACCCGGAAGATCATTGTGGACTCGGA 
CATTTACTGGCCCAATGGACTGACCATCGACCTGGAGGAGCAGAAGCTC 
TACTGGGCTGACGCCAAGCTCAGCTTCATCCACCGTGCCAACCTGGACG 

GCTCGTTCCGgtaggtacccac 
... 6094 nt ... 



Exon 4 Coordinates: 527dl2_Contig308G 15047-14850 

tccctgactgcagGCAGAAGGTGGTGGAGGGCAGCCTGACGCACCCCTTCGCCC 
TGACGCTCTCCGGGGACACTCTGTACTGGACAGACTGGCAGACCCGCTC 
CATCCATGCCTGCAACAAGCGCACTGGGGGGAAGAGGAAGGAGATCCTG 
AGTGCCCTATACTCACCCATGGACATCCAGGTGCTGAGCCAGGAGCGGC 



AGCCTTTCTgtgagtgccgg 
... 1827 nt... 



Exon 5 Coordinates: 527dl2_Contig308G 13220-13088 

tttctcagTCCACACTCGCTGTGAGGAGGACAATGGCGGCTGCTCCCACCTGT 
GCCTGCTGTCCCCAAGCGAGCCTTTCTACACATGCGCCTGCCCCACGGG 
TGTGCAGCTGCAGGACAACGGCAGGACGTGTAAGGCAGgtgaggcggtgggacg 

FIG. 3A 
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... 20923 nt ... 

Exon 6 Coordinates: 527dl2_Contig309G 7705-8100 

ctccacagGAGCCGAGGAGGTGCTGCTGCTGGCCCGGCGGACGGACCTACG 

GAGGATCTCGCTGGACACGCCGGACTTCACCGACATCGTGCTGCAGGTG 

GACGACATCCGGCACGCCATTGCCATCGACTACGACCCGCTAGAGGGCT 

ATGTCTACTGGACAGATGACGAGGTGCGGGCCATCCGCAGGGCGTACCT 

GGACGGGTCTGGGGCGCAGACGCTGGTCAACACCGAGATCAACGACCC 

CGATGGCATCGCGGTCGACTGGGTGGCCCGAAACCTCTACTGGACCGAC 

ACGGGCACGGACCGCATCGAGGTGACGCGCCTCAACGGCACCTCCCGCA 

AGATCCTGGTGTCGGAGGACCTGGACGAGCCCCGAGCCATCGCACTGCA 

CCCCGTGATGGGgtaagacgggc 
3211 nt 



Exon 7 Coordinates: 527dl2_Contig309G 11311-11482 

ttcttctccagCCTCATGTACTGGACAGACTGGGGAGAGAACCCTAAAATCGAG 
TGTGCCAACTTGGATGGGCAGGAGCGGCGTGTGCTGGTCAATGCCTCCC 
TCGGGTGGCCCAACGGCCTGGCCCTGGACCTGCAGGAGGGGAAGCTCT 
ACTGGGGAGACGCCAAGACAGACAAGATCGAGgtgaggctcctgtgg 



13445 nt 



Exon 8 Coordinates: 527dl2_Contig309G 24927-25143 

ccgtcctgcagGTGATCAATGTTGATGGGACGAAGAGGCGGACCCTCCTGGAG 

GACAAGCTCCCGCACATTTTCGGGTTCACGCTGCTGGGGGACTTCATCT 

ACTGGACTGACTGGCAGCGCCGCAGCATCGAGCGGGTGCACAAGGTCAA 

GGCCAGCCGGGACGTCATCATTGACCAGCTGCCCGACCTGATGGGGCTC 

AAAGCTGTGAATGTGGCCAAGGTCGTCGgtgagtccggggggtc 

....2826 nt 



Exon 9 Coordinates: 527dl2_Contig309G 27969-28256 

BttcgcttccagGAACCAACCCGTGTGCGGACAGGAACGGGGGGTGCAGCCACC 

TGTGCTTCTTCACACCCCACGCAACCCGGTGTGGCTGCCCCATCGGCCT 

GGAGCTGCTGAGTGACATGAAGACCTGCATCGTGCCTGAGGCCTTCTTG 

GTCTTCACCAGCAGAGCCGCCATCCACAGGATCTCCCTCGAGACCAATA 

ACAACGACGTGGCCATCCCGCTCACGGGCGTCAAGGAGGCCTCAGCCCT 

GGACTTTGATGTGTCCAACAACCACATCTACTGGACAGACGTCAGCCTG 

AAGgtagcgtgggc 
3102 

FIG. 3B 
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Exon 10 Coordinates: 527dl2_Contig309G 31358-31582 

cctgctgccagACCATGAGCCGCGCCTTCATGAACGGGAGCTCGGTGGAGCAC 

GTGGTGGAGTTTGGCCTTGACTACCCCGAGGGCATGGCCGTTGACTGGA 

TGGGCAAGAACCTCTACTGGGCCGACACTGGGACCAACAGAATCGAAGT 

GGCGCGGCTGGACGGGCAGTTCCGGCAAGTCCTCGTGTGGAGGGACTT 

GGACAACCCGAGGTCGCTGGCCCTGGATCCCACCAAGGGgtaagtgtttgcctgtc 

1297 nt 

Exon 11 Coordinates: 527dl2_Contig309G 32879-33064 

gtgccttccagCTACATCTACTGGACCGAGTGGGGCGGCAAGCCGAGGATCGT 

GCGGGCCTTCATGGACGGGACCAACTGCATGACGCTGGTGGACAAGGTG 

GGCCGGGCCAACGACCTCACCATTGACTACGCTGACCAGCGCCTCTACT 

GGACCGACCTGGACACCAACATGATCGAGTCGTCCAACATGCTGGgtgaggg 

ccgggct 

2069 nt 

Exon 12 Coordinates: 527dl2_Contig309G 35133-35454 

gtgttcatgcagGTCAGGAGCGGGTCGTGATTGCCGACGATCTCCCGCACCCGT 

TCGGTCTGACGCAGTACAGCGATTATATCTACTGGACAGACTGGAATCT 

GCACAGCATTGAGCGGGCCGACAAGACTAGCGGCCGGAACCGCACCCTC 

ATCCAGGGCCACCTGGACTTCGTGATGGACATCCTGGTGTTCCACTCCT 

CCCGCCAGGATGGCCTCAATGACTGTATGCACAACAACGGGCAGTGTGG 

GCAGCTGTGCCTTGCCATCCCCGGCGGCCACCGCTGCGGCTGCGCCTCA 

CACTACACCCTGGACCCCAGCAGCCGCAACTGCAGCCgtaagtgcctcatggt 

2006 nt 

Exon 13 Coordinates: 527dl2_Contig309G 37460-37659 

gcctcctctaCGCCCACCACCTTCTTGCTGTTCAGCCAGAAATCTGCCATCAGT 

CGGATGATCCCGGACGACCAGCACAGCCCGGATCTCATCCTGCCCCTGC 

ATGGACTGAGGAACGTCAAAGCCATCGACTATGACCCACTGGACAAGTT 

CATCTACTGGGTGGATGGGCGCCAGAACATCAAGCGAGCCAAGGACGAC 

GGGACCCAGgcaggtgccctgtgg 

......6965 nt 



FIG- 3C 
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Exon 14 Coordinates: 527dl2_Contig309G 44624-44832 

ctttgtcttacagCCCTTTGTTTTGACCTCTCTGAGCCAAGGCCAAAACCCAGACA 
GGCAGCCCCACGACCTCAGCATCGACATCTACAGCCGGACACTGTTCTG 
GACGTGCGAGGCCACCAATACCATCAACGTCCACAGGCTGAGCGGGGAA 
GCCATGGGGGTGGTGCTGCGTGGGGACCGCGACAAGCCCAGGGCCATC 
GTCGTCAACGCGGAGCGAGGgtaggaggccaac 



.1404 nt. 



Exon 15 Coordinates: 527dl2_Contig309G 46236-46427 

ccaccctcccgcagGTACCTGTACTTCACCAACATGCAGGACCGGGCAGCCAAGA 
TCGAACGCGCAGCCCTGGACGGCACCGAGCGCGAGGTCCTCTTCACCAC 
CGGCCTCATCCGCCCTGTGGCCCTGGTGGTGGACAACACACTGGGCAAG 
CTGTTCTGGGTGGACGCGGACCTGAAGCGCATTGAGAGCTGTGACCTGT 

CAGgtacgcgccccgg 
686 nt 



Exon 16 Coordinates: 527dl2_Contig309G 47113-47322 

ggctgcttgcagGGGCCAACCGCCTGACCCTGGAGGACGCCAACATCGTGCAGC 

CTCTGGGCCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCGCCA 

GCAGCAGATGATCGAGCGTGTGGAGAAGACCACCGGGGACAAGCGGAC 

TCGCATCCAGGGCCGTGTCGCCCACCTCACTGGCATCCATGCAGTGGAG 

GAAGTCAGCCTGGAGGAGTTCTgtacgtgggggc 



.3884 nt. 



Exon 17 Coordinates: 527dl2_Contig309G 51206-51331 

ttgtctttgcagCAGCCCACCCATGTGCCCGTGACAATGGTGGCTGCTCCCACAT 
CTGTATTGCCAAGGGTGATGGGACACCACGGTGCTCATGCCCAGTCCAC 
CTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGgtaggtgtgacctaggtgc 



.3905 nt. 



Exon 18 Coordinates: 527dl2_Contig309G 55236-55472 

gttctcctctgtccctcccccagAGCCGCCCACCTGCTCCCCGGACCAGTTTGCATGTG 

CCACAGGGGAGATCGACTGTATCCCCGGGGCCTGGCGCTGTGACGGCTT 

TCCCGAGTGCGATGACCAGAGCGACGAGGAGGGCTGCCCCGTGTGCTCC 

GCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTGC 

GCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGGACT 

GTGACGgtgaggccctcc 



.3052 nt 

FIG. 3D 
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Exon 19 Coordinates: 527dl2_Contig309G 58524-58634 

tctccttgcagCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTG 

TGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCT 

CCGACGAGCTCATGTGTGgtgagccagctt 

1448 nt 



Exon 20 Coordinates: 527dl2_Contig309G. 60082-60319 

gtttgtctctggcagAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGC 
AGTGCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCAT? GG 
TGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGC JG 
GCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGCACGTGC 
CCCTCAATTTCATAGCCCCGGGCGGTTCCCAGCATGGCCCCTTCACAGgta 
aggagcctgagatatggaa 



....1095 nt 

Exon 21 Coordinates: 527dl2_Contig309G 61414-61552 

cttccctgccagGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCCTGA 
TGGGGGGCCGGGGCGGGGTGCCCCTCTACGACCGGAACCACGTCACAG 
GGGCCTCGTCCAGCAGCTCGTCCAGCACGAAGGCCACGCTGTACCCGCC 

Ggtgaggggcggg 



6513nt 

Exon 22 Coordinates: 527dl2_Contig309G 68065-68162 

ttggctctcctcagATCCTGAACCCGCCGCCCTCCCCGGCCACGGACCCCTCCCT 
GTACAACATGGACATGTTCTACTCTTCAAACATTCCGGCCACTGCGAGAC 

CGTACAGgtaggacatcccctgcag 



2273 nt 

FIG. 3E 
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Exon 23 Coordinates: 527dl2 Contig309G 70435-70901 

tcaaacattccggccactgcgagaccgtacagGCCCTACATCATTCGAGGAATGGCGCCCCC 

GA^GA^G^CCTGCAGCACCGACGTGTGTGACAGCGACTACAGCGCCAGC 
CGCTGGAAGGCCAGCAAGTACTACCTGGATTTGAACTCGGACTCAGACC 
CCTATCCACCCCCACCCACGCCCCACAGCCAGTACCTGTCGGCGGAGGA 
CAGCTGCCCGCCCTCGCCCGCCACCGAGAGGAGCTACTTCCATCTCTTC 
CCGCCCCCTCCGTCCCCCTGCACGGACTCATCCTGACCTCGGCCGGOCCA 
r.Tr.TGGCTTrTrTGTGCr.rrTGTAA A TAGTTTTAAATATGAACAAAGAAAAAA 
ATATATTT TATGATTTAA A A A ATAA A TATA ATTGGGATTTTAA A A ACATGAGA 
A ATOTGA ACTGTnATOOGGTGG nr A r T r^r.TGGG A G A ACTTTGTACAGTGOAG 
A A ATATTTA T A A ACTTA A TTTTGTA A AACA 

FIG. 3F 
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gac tea gag acc aac cgc ate gag gtg gcc aac etc aat ggc aca tec 493 
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<212> DNA 
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<400> 1 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
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Asp Ser Glu Thr Asn Axg lie Glu Val Ala Asn Leu Asn Gly Thr Ser 

130 135 140 

egg aag gtg etc ttc tgg cag gac ctt gac cag ccg agg gee ate gec 541 
Arg Lys Val Leu Phe Trp Gin Asp Leu Asp Gin Pro Arg Ala lie Ala 

145 150 155 

ttg gac ccc get cac ggg tac atg tac tgg aca gac tgg ggt gag acg 589 
Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr 

160 165 170 

ccc egg art gag egg gca ggg atg gat ggc age ace egg aag ate att 637 
Pro Arg He Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys lie He 

175 180 185 

gtg gac teg gac att tac tgg ccc aat gga ctg acc ate gac ctg gag 685 
Val Asp Ser Asp He Tyr Trp Pro Asn Gly Leu Thr He Asp Leu Glu 
190 195 200 205 

gag cag aag etc tac tgg get gac gec aag etc age ttc ate cac cgt 733 
Glu Gin Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe Ee His Arg 

210 215 220 

gee aac ctg gac ggc teg ttc egg cag aag gtg gtg gag ggc age ctg 78 1 
Ala Asn Leu Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu 

225 230 235 

acg cac ccc ttc gec ctg acg etc tec ggg gac act ctg tac tgg aca 829 
Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr 
240 245 250 



WO 01/92891 PCTAJS01/16946 



gac tgg cag acc cgc tec ate cat gec tgc aac aag cgc act ggg ggg 877 
Asp Trp Gin Thr Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly 

255 260 265 

aag agg aag gag ate ctg agt gec etc tac tea ccc atg gac ate cag 925 
Lys Arg Lys Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp lie Gin 
270 275 280 285 

gtg ctg age cag gag egg cag cct ttc ttc cac act cgc tgt gag gag 973 
Val Leu Ser Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu 

290 295 300 

gac aat ggc ggc tgc tec cac ctg tgc ctg ctg tec cca age gag cct 1021 
Asp Asn Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro 

305 310 315 

ttc tac aca tgc gec tgc ccc acg ggt gtg cag ctg cag gac aac ggc 1069 
Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gin Leu Gin Asp Asn Gly 

320 325 330 

agg acg tgt aag gca gga gec gag gag gtg ctg ctg ctg gee egg egg 1117 
Arg Thr Cys Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg 

335 340 345 

acg gac eta egg agg ate teg ctg gac acg ccg gac ttc acc gac ate 1 165 
Thr Asp Leu Arg Arg He Ser Leu Asp Thr Pro Asp Phe Thr Asp lie 
350 355 360 365 

gtg ctg cag gtg gac gac ate egg cac gee att gec ate gac tac gac 1213 
Val Leu Gin Val Asp Asp He Arg His Ala He Ala He Asp Tyr Asp 
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370 375 380 

ccg eta gag ggc tat gtc tac tgg aca gat gac gag gtg egg gec ate 1261 
Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie 

385 390 395 

cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac acc 1309 
Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr 

400 405 410 

gag ate aac gac ccc gat ggc ate gcg gtc gac tgg gtg gee cga aac 1357 
Glu He Asn Asp Pro Asp Gly lie Ala Val Asp Trp Val Ala Arg Asn 

415 420 425 

etc tac tgg acc gac acg ggc acg gac cgc ate gag gtg acg cgc etc 1405 
Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu 
430 435 440 445 

aac ggc acc tec cgc aag ate ctg gtg teg gag gac ctg gac gag ccc 1453 
Asn Gly Thr Ser Arg Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro 

450 455 460 

cga gee ate gca ctg cac ccc gtg atg ggc etc atg tac tgg aca gac 1501 
Arg Ala lie Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp 

465 470 475 

tgg gga gag aac cct aaa ate gag tgt gee aac ttg gat ggg cag gag 1549 
Trp Gly Glu Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu 

480 485 490 

egg cgt gtg ctg gtc aat gee tec etc ggg tgg ccc aac ggc ctg gee 1597 
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Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gec aag aca gac 1645 
Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 
510 515 520 525 

aag ate gag gtg ate aat gtt gat ggg acg aag agg egg acc etc ctg 1693 
Lys He Glu Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 
Glu Asp Lys Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg cag cgc cgc age ate gag egg gtg cac aag 1789 
He Tyr Trp Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys 

560 565 570 

gtc aag gec age egg gac gtc ate att gac cag ctg ccc gac ctg atg 1837 
Val Lys Ala Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met 

575 580 585 

ggg etc aaa get gtg aat gtg gec aag gtc gtc gga acc aac ccg tgt 1885 
Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 
590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ccc cac 1933 
Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 

610 615 620 
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gca acc egg tgt ggc tgc ccc ate ggc ctg gag ctg ctg agt gac atg 1981 
Ala Thr Arg Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met 

625 630 635 

aag acc tgc ate gtg cct gag gee ttc ttg gtc ttc acc age aga gec 2029 
Lys Thr Cys lie Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 

640 645 650 

gee ate cac agg ate tec etc gag acc aat aac aac gac gtg gec ate 2077 
Ala He His Arg lie Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He 

655 660 665 

ccg etc acg ggc gtc aag gag gec tea gee ctg gac ttt gat gtg tec 2125 
Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser 
670 675 680 685 

aac aac cac ate tac tgg aca gac gtc age ctg aag acc ate age cgc 2173 
Asn Asn His He Tyr Trp Thr Asp Val Ser Leu Lys Thr He Ser Arg 

690 695 700 

gee ttc atg aac ggg age teg gtg gag cac gtg gtg gag ttt ggc ctt 2221 
Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu 

705 710 715 

gac tac ccc gag ggc atg gec gtt gac tgg atg ggc aag aac etc tac 2269 
Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr 

720 725 730 

tgg gee gac act ggg acc aac aga ate gaa gtg gcg egg ctg gac ggg 23 17 
Trp Ala Asp Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly 
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735 740 745 

cag ttc egg caa gtc etc gtg tgg agg gac ttg gae aae eeg agg teg 2365 
Gin Phe Axg Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser 
750 755 760 765 

etg gee etg gat eec aee aag gge tae ate tae tgg aee gag tgg gge 2413 
Leu A>a Leu Asp Pro Thr Lys G.y Tyr Be Tyr Trp Thr Glu Trp G.y 

770 775 780 

gge aag eeg agg ate gtg egg gee ttc atg gae ggg aee aac tgc atg 2461 
Gly Lys Pro Arg He Va. Arg A.a Phe Me, Asp Gly Tnr Asn Cys Met 

785 790 795 

aeg ctg gtg g- aag gtg gge egg gee aae gae etc ace at. gac tae 2509 
Leu Va, Asp Lys Val Gly Arg Ala Asn Asp Leu Thr Be Asp Tyr 
800 805 810 

ge, gae eag egc etc tae tgg aee gae etg gac aee aae atg ate gag 2557 

815 820 825 

ttg tee aae atg ctg ggt cag gag egg gtc gtg att gee gae ga, etc 2605 
Ser Ser Asa Met Leu Gly Gin Glu Arg Val Val Be Ala Asp Asp Leu 
830 835 840 845 

eeg cac ccg ttc ggt etg acg eag tac age gat tat ate tae tgg aea 2653 
Pro His Pro Phe Gly Leu Thr Gln Tyr Ser Asp Tyr Be Tyr Trp Thr 

850 855 860 

gae tgg aa, ctg cac age att gag egg gee gae aag act age gge egg 2701 
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Asp Tip Asn Leu His Ser lie Glu Arg Ala Asp Lys Thr Ser Gly Arg 

865 870 875 

aac cgc acc etc ate cag ggc cac ctg gac ttc gtg atg gac ate ctg 2749 
Asn Arg Thr Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu 

880 885 890 

gtg ttc cac tec tec cgc cag gat ggc etc aat gac tgt atg cac aac 2797 
Val Phe His Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn 

895 900 905 

aac ggg cag tgt ggg cag ctg tgc ctt gec ate ccc ggc ggc cac cgc 2845 
Asn Gly Gin Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg 
910 915 920 925 

tgc ggc tgc gee tea cac tac acc ctg gac ccc age age cgc aac tgc 2893 
Cys Gly Cys Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys 

930 935 940 

age ccg ccc acc acc ttc ttg ctg ttc age cag aaa tct gee ate agt 2941 
Ser Pro Pro Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He Ser 

945 950 955 

egg atg ate ccg gac gac cag cac age ccg gat etc ate ctg ccc ctg 2989 
Arg Met He Pro Asp Asp Gin His Ser Pro Asp Leu He Leu Pro Leu 

960 965 970 

cat gga ctg agg aac gtc aaa gee ate gac tat gac cca ctg gac aag 3037 
His Gly Leu Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys 
975 980 985 
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ttc ate tac tgg gtg gat ggg cgc cag aac ate aag cga gee aag gac 3085 
Phe He Tyr Tip Val Asp Gly Arg Gin Asn lie Lys Arg Ala Lys Asp 
990 995 1000 1005 

gac ggg acc cag ccc ttt gtt ttg ace tct ctg age caa ggc caa aac 3 133 
Asp Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 

1010 1015 1020 

cca gac agg cag ccc cac gac etc age ate gac ate tac age egg aca 3181 
Pro Asp Arg Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr 

1025 1030 1035 

ctg ttc tgg acg tgc gag gee acc aat acc ate aac gtc cac agg ctg 3229 
Leu Phe Trp Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu 

1040 1045 1050 

age ggg gaa gec atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 3277 
Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gec ate gtc gtc aac gcg gag cga ggg tac ctg tac ttc acc aac 3325 
Arg Ala lie Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 
1070 1075 1080 1085 

atg cag gac egg gca gec aag ate gaa cgc gca gee ctg gac ggc acc 3373 
Met Gin Asp Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 1100 

gag cgc gag gtc etc ttc acc acc ggc etc ate cgc cct gtg gec ctg 3421 
Glu Arg Glu Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu 
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1105 1110 1115 

gtg gtg gac aac aca ctg ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469 
Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 

1120 1125 1130 

aag cgc att gag age tgt gac ctg tea ggg gec aac cgc ctg acc ctg 3517 
Lys Arg lie Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu 

1135 1140 1145 

gag gac gec aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag 3565 
Glu Asp Ala Asn lie Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys 
1150 1155 1160 1165 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3613 
His Leu Tyr Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gec cac 3661 
Lys Thr Thr Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His 

1185 1190 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3709 
Leu Thr Gly He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gec cac cea tgt gec cgt gac aat ggt ggc tgc tec cac ate tgt att 3757 
Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He 

1215 1220 1225 

gec aag ggt gat ggg aca cea egg tgc tea tgc cea gtc cac etc gtg 3805 
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Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val 
1230 1235 1240 1245 

etc ctg cag aac ctg ctg acc tgt gga gag ccg ccc acc tgc tec ccg 3853 
Leu Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro 

1250 1255 1260 

gac cag ttt gca tgt gec aca ggg gag ate gac tgt ate ccc ggg gee 3901 
Asp Gin Phe Ala Cys Ala Thr Gly Glu lie Asp Cys He Pro Gly Ala 

1265 1270 1275 

tgg cgc tgt gac ggc ttt ccc gag tgc gat gac cag age gac gag gag 3949 
Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 

1280 1285 1290 

ggc tgc ccc gtg tgc tec gec gee cag ttc ccc tgc gcg egg ggt cag 3997 
Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

1295 1300 1305 

tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4045 
Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 
1310 1315 1320 1325 

cgc tea gac gag gtg gac tgt gac gec ate tgc ctg ccc aac cag ttc 4093 
Arg Ser Asp Glu Val Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe 

1330 1335 1340 

egg tgt gcg age ggc cag tgt gtc etc ate aaa cag cag tgc gac tec 4141 
Arg Cys Ala Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser 
1345 1350 1355 
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ttc ccc gac tgt ate gac ggc tec gac gag etc atg tgt gaa ate ace 4189 
Phe Pro Asp Cys lie Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr 

1360 1365 1370 

aag ccg ccc tea gac gac age ccg gee cac age agt gec ate ggg ccc 4237 
Lys Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala lie Gly Pro 

1375 1380 1385 

gtc att ggc ate ate etc tct etc ttc gtc atg ggt ggt gtc tat ttt 4285 
Val He Gly lie He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe 
1390 1395 1400 1405 

gtg tgc cag cgc gtg gtg tgc cag cgc tat gcg ggg gee aac ggg ccc 4333 
Val Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro 

1410 1415 1420 

ttc ccg cac gag tat gtc age ggg ace ccg cac gtg ccc etc aat ttc 4381 
Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 

1425 1430 1435 

ata gee ccg ggc ggt tec cag cat ggc ccc ttc aca ggc ate gca tgc 4429 
He Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys 

1440 1445 1450 

gga aag tec atg atg age tec gtg age ctg atg ggg ggc egg ggc ggg 4477 
Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 

1455 1460 1465 

gtg ccc etc tac gac egg aac cac gtc aca ggg gee teg tec age age 4525 
Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 
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1470 1475 1480 1485 

teg tec age acg aag gee acg ctg tac ccg ccg ate ctg aac ccg ccg 4573 

Ser Ser Ser Thr Lys Ala Thx Leu Tyr Pro Pro lie Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gee acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 
Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gec act gcg aga ccg tac agg ccc tac ate art 4669 
Ser Ser Asn lie Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He lie 

1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age acc gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gee age cgc tgg aag gec age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Tip Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee acc gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
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Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 



1600 



1605 



1610 



tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 



4965 



Ser Ser 



1615 



aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 



<210> 2 
<211> 5120 
<212> DNA 
<213> Homo sapiens 

<400 > 2 

actaaagege cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
15 10 
ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gec ccc gec 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gec teg ccg etc ctg eta ttt gec aac cgc cggjjac gta egg ctg 205 



tggagaaata tttataaact taattttgta aaaca 



5120 
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Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 

gtg gac gcc ggc gga gtc aag ctg gag tec acc ate gtg gtc age ggc 253 
Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr He Val Val Ser Gly 

50 55 60 

ctg gag gat gcg gcc gca gtg gac ttc cag ttt tec aag gga gcc gtg 301 
Leu Glu Asp Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val 

65 70 75 

tac tgg aca gac gtg age gag gag gcc ate aag cag acc tac ctg aac 349 
Tyr Trp Thr Asp Val Ser Glu Glu Ala He Lys Gin Thr Tyr Leu Asn 

80 85 90 

cag acg ggg gcc gcc gtg cag aac gtg gtc ate tec ggc ctg gtc tct 397 
Gin Thr Gly Ala Ala Val Gin Asn Val Val He Ser Gly Leu Val Ser 

95 100 105 

ccc gac ggc etc gcc tgc gac tgg gtg ggc aag aag ctg tac tgg acg 445 
Pro Asp Gly Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr 
110 115 120 125 

gac tea gag acc aac cgc ate gag gtg gcc aac etc aat ggc aca tec 493 
Asp Ser Glu Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser 

130 135 140 

egg aag gtg etc ttc tgg cag gac ctt gac cag ccg agg gcc ate gcc 541 
Arg Lys Val Leu Phe Trp Gin Asp Leu Asp Gin Pro Arg Ala lie Ala 
145 150 155 
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ttg gac ccc get cac ggg tac atg tac tgg aca gac tgg gtt gag acg 589 
Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr 

160 165 170 

ccc egg art gag egg gca ggg atg gat ggc age ace egg aag ate art 637 
Pro Arg He Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys lie He 

175 180 185 

gtg gac teg gac art tac tgg ccc aat gga ctg acc ate gac ctg gag 685 
Val Asp Ser Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu 
190 195 200 205 

gag cag aag etc tac tgg get gac gee aag etc age ttc ate cac cgt 733 
Glu Gin Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe lie His Arg 

210 215 220 

gee aac ctg gac ggc teg ttc egg cag aag gtg gtg gag ggc age ctg 781 
Ala Asn Leu Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu 

225 230 235 

acg cac ccc ttc gee ctg acg etc tec ggg gac act ctg tac tgg aca 829 
Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr 

240 245 250 

gac tgg cag acc cgc tec ate cat gee tgc aac aag cgc act ggg ggg 877 
Asp Trp Gin Thr Arg Ser lie His Ala Cys Asn Lys Arg Thr Gly Gly 

255 260 265 

aag agg aag gag ate ctg agt gee etc tac tea ccc atg gac ate cag 925 
Lys Arg Lys Glu lie Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin 
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270 275 280 285 

gtg ctg age cag gag egg cag cct tie tie cac act cgc tgt gag gag 973 

Val Leu Ser Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu 

290 295 300 

gac aat ggc ggc tgc tec cac ctg tgc ctg ctg tec cca age gag cct 1021 
Asp Asn Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro 

305 310 315 

ttc tac aca tgc gec tgc ccc acg ggt gtg cag ctg cag gac aac ggc 1069 
Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gin Leu Gin Asp Asn Gly 

320 325 330 

agg acg tgt aag gca gga gee gag gag gtg ctg ctg ctg gee egg egg 1117 
Arg Thr Cys Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg 

335 340 345 

aeg gac eta egg agg ate teg ctg gac acg ccg gac ttc acc gac ate 1165 
Thr Asp Leu Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp lie 
350 355 360 365 

gtg ctg cag gtg gac gac ate egg cac gee att gee ate gac tac gac 1213 
Val Leu Gin Val Asp Asp He Arg His Ala lie Ala lie Asp Tyr Asp 

370 375 380 

ccg eta gag ggc tat gtc tac tgg aca gat gac gag gtg egg gec ate 1261 
Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala He 

385 390 395 

cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac acc 1309 
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Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr 

400 405 410 

gag ate aac gac ccc gat ggc ate gcg gtc gac tgg gtg gee cga aac 1357 
Glu He Asn Asp Pro Asp Gly lie Ala Val Asp Trp Val Ala Arg Asn 

415 420 425 

etc tac tgg acc gac acg ggc acg gac cgc ate gag gtg acg cgc etc 1405 
Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu 
430 435 440 445 

aac ggc acc tec cgc aag ate ctg gtg teg gag gac ctg gac gag ccc 1453 
Asn Gly Thr Ser Arg Lys lie Leu Val Ser Glu Asp Leu Asp Glu Pro 

450 455 460 

cga gee ate gca ctg cac ccc gtg atg ggc etc atg tac tgg aca gac 1501 
Arg Ala He Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp 

465 470 475 

tgg gga gag aac cct aaa ate gag tgt gee aac ttg gat ggg cag gag . 1549 
Trp Gly Glu Asn Pro Lys lie Glu Cys Ala Asn Leu Asp Gly Gin Glu 

480 485 490 

egg cgt gtg ctg gtc aat gee tec etc ggg tgg ccc aac ggc ctg gee 1597 
Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gee aag aca gac 1645 
Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 
510 515 520 525 
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aag ate gag gtg ate aat gtt gat ggg acg aag agg egg acc etc ctg 1693 
Lys He Glu Val lie Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 
Glu Asp Lys Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg cag cgc cgc age ate gag egg gtg cac aag 1789 
lie Tyr Trp Thr Asp Trp Gin Arg Arg Ser lie Glu Arg Val His Lys 

560 565 570 

gtc aag gee age egg gac gtc ate att gac eag ctg ccc gac ctg atg 1837 
Val Lys Ala Ser Arg Asp Val lie lie Asp Gin Leu Pro Asp Leu Met 

575 580 585 

ggg etc aaa get gtg aat gtg gec aag gtc gtc gga acc aac ccg tgt 1885 
Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 
590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ccc cac 1933 
Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 

610 615 620 

gca acc egg tgt ggc tgc ccc ate ggc ctg gag ctg ctg agt gac atg 1981 
Ala Thr Arg Cys Gly Cys Pro lie Gly Leu Glu Leu Leu Ser Asp Met 

625 630. 635 

aag acc tgc ate gtg cct gag gec ttc ttg gtc ttc acc age aga gee 2029 
Lys Thr Cys lie Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 
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640 645 650 

gcc ate cac agg ate tec etc gag ace aat aac aac gac gtg gee ate 2077 
Ala He His Arg lie Ser Leu Glu Thr Asn Asa Asn Asp Val Ala lie 

655 660 665 

ccg etc acg ggc gtc aag gag gcc tea gcc ctg gac ttt gat gtg tec 2125 
Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser 
670 675 680 685 

aac aac cac ate tac tgg aca gac gtc age ctg aag acc ate age cgc 2173 
Asn Asn His lie Tyr Trp Thr Asp Val Ser Leu Lys Thr lie Ser Arg 

690 695 700 

gcc ttc atg aac ggg age teg gtg gag cac gtg gtg gag ttt ggc ctt 2221 
Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu 

705 710 715 

gac tac ccc gag ggc atg gcc gtt gac tgg atg ggc aag aac etc tac 2269 
Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr 

720 725 730 

tgg gcc gac act ggg acc aac aga ate gaa .gtg gcg egg ctg gac ggg 23 17 
Trp Ala Asp Thr Gly Thr Asn Arg lie Glu Val Ala Arg Leu Asp Gly 

735 740 745 

cag ttc egg caa gtc etc gtg tgg agg gac ttg gac aac ccg agg teg 2365 
Gin Phe Arg Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser 
750 755 760 765 

ctg gcc ctg gat ccc acc aag ggc tac ate tac tgg acc gag tgg ggc 2413 
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Leu Ala Leu Asp Pro Thr Lys Gly Tyr He Tyr Trp Thr Glu Trp Gly 

770 775 780 

ggc aag ccg agg ate gtg egg gec ttc atg gac ggg acc aac tgc atg 2461 
Gly Lys Pro Arg He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met 

785 790 795 

acg ctg gtg gac aag gtg ggc egg gec aac gac etc acc att gac tac 2509 
Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr He Asp Tyr 

800 805 810 

get gac cag cgc etc tac tgg acc gac ctg gac acc aac atg ate gag 2557 
Ala Asp Gin Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asu Met He Glu 

815 820 825 

teg tec aac atg ctg ggt cag gag egg gtc gtg att gec gac gat etc 2605 
Ser Ser Asn Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu 
830 835 840 845 

ccg cac ccg ttc ggt ctg acg cag tac age gat tat ate tac tgg aca 2653 

Pro His Pro Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr 

850 855 860 

gac tgg aat ctg cac age att gag egg gec gac aag act age ggc egg 2701 
Asp Trp Asn Leu His Ser He Glu Arg Ala Asp Lys Thr Ser Gly Arg 

865 870 875 

aac cgc acc etc ate cag ggc cac ctg gac ttc gtg atg gac ate ctg 2749 
Asn Arg Thr Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu 
880 885 890 
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gtg ttc cac tec tec cgc cag gat ggc etc aat gac tgt atg cac aac 2797 
Val Phe His Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn 

895 900 • 905 

aac ggg cag tgt ggg cag ctg tgc ctt gee ate ecc ggc ggc cac cgc 2845 
Asn Gly Gin Cys Gly Gin Leu Cys Leu Ala lie Pro Gly Gly His Arg 
910 915 920 925 

tgc ggc tgc gee tea cac tac ace ctg gac ccc age age cgc aac tgc 2893 
Cys Gly Cys Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys 

930 935 940 

age ccg ccc ace acc ttc ttg ctg ttc age cag aaa tct gee ate agt 2941 
Ser Pro Pro Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala lie Ser 

945 950 955 

egg atg ate ccg gac gac cag cac age ccg gat etc ate ctg ccc ctg 2989 
Arg Met He Pro Asp Asp Gin His Ser Pro Asp Leu He Leu Pro Leu 

960 965 970 

cat gga ctg agg aac gtc aaa gee ate gac tat gac cca ctg gap aag 3037 
His Gly Leu Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys 

975 980 985 

ttc ate tac tgg gtg gat ggg cgc cag aac ate aag cga gee aag gac 3085 
Phe He Tyr Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp 
990 995 1000 1005 

gac ggg acc cag ccc ttt gtt ttg acc tct ctg age caa ggc caa aac 3 133 
Asp Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 
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1010 1015 1020 

cca gac agg cag ccc cac gac etc age ate gae ate tac age egg aea 3181 
Pro Asp Arg Gin Pro His Asp Leu Set He Asp He Tyr Ser Arg Thr 

1025 1030 1035 

etg ttc tgg aeg tgc gag gee aee aat aee ate aae gtc eao agg etg 3229 
Leu Phe Trp Tar Cys Glu Ala Thr Asn Thr He Asn Yal His Arg Leu 

1040 1045 1050 

age ggg gaa gec atg ggg gtg gtg etg egt ggg gae cgc gae aag eec 3277 
Ser Gly Glu' Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gee ate gtc gtc aae gcg gag cga ggg tac etg tac ttc aee aac 3325 
Arg Ala He Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asb 
1070 ' 1075 1080 1085 

atg cag gac egg gea gee aag ate gaa cgc gca gec etg gae gge ace 3373 
Met Gin Asp Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 HOO 

gag cgc gag gtc etc ttc ace aee gge etc ate cgc cct gtg gec etg 3421 

Glu Arg Glu Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu 

u0 5 H10 1115 

gtg gtg gac aac aca etg gge aag etg ttc tgg gtg gac gcg gae etg 3469 
Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 

1120 H25 H30 

aag cgc att gag age tgt gac etg tea ggg gec aac cgc etg aee etg 3517 
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Lys Arg lie Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu 

1135 1140 1145 

gag gac gcc aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag 3565 
Glu Asp Ala Asn lie Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys 
1150 1155 1160 1165 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3613 
His Leu Tyr Trp He Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gcc cac 3661 
Lys Thr Thr Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His 

1185 1190 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3709 
Leu Thr Gly lie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gcc cac cca tgt gcc cgt gac aat ggt ggc tgc tec cac ate tgt att 3757 
Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His lie Cys Be 

1215 1220 1225 

gcc aag ggt gat ggg aca cca egg tgc tea tgc cca gtc cac etc gtg 3805 
Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val 
1230 1235 1240 1245 

etc ctg cag aac ctg ctg acc tgt gga gag ccg 'ccc acc tgc tec ccg 3853 
Leu Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro 
1250 1255 1260 
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gac cag ttt gca tgt gcc aca ggg gag ate gac tgt ate ece ggg gee 3901 
Asp Gin Pne Ala Cys Ala Tbx Gly Glu lie Asp Cys He Pro Gly Ala 

1265 1270 1275 

tgg cge tgt gae ggc ttt cee gag tgc gat gae eag age gac gag gag 3949 
Tip Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 

1280 1285 1290 

ggc tgc ccc gtg tgc tec gcc gee cag ttc cee tgc gcg egg ggt cag 3997 
Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

12 95 1300 1305 

tgt gtg gae etg cge etg ege tgc gae ggc gag gca gae tgt eag gac 4045 
Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 
13 10 1315 1320 1325 

cge tea gae gag gtg gae tgt gae gee ate tgc etg ecc aac eag ttc 4093 
Arg Ser Asp Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe 

1330 1335 1340 

egg tgt geg age ggc cag tgt gte etc ate aaa eag eag tgc gac tee 4141 
Arg Cys Ala Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser 

1345 1350 1355 

ttc cee gac tgt ate gac ggc tec gae gag etc atg tgt gaa ate ace 4189 
. Phe Pro Asp Cys lie Asp- Gly Ser Asp Glu Leu Met Cys Glu lie Thr 
1360 1365 1370 

aag eeg ccc tea gae gac age ccg gee eae age agt gee ate ggg ecc 4237 
Lys Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro 
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1375 1380 1385 

gtc att ggc ate ate etc tct etc ttc gtc atg ggt ggt gtc tat ttt 4285 
Val lie Gly Be He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe 
1390 1395 1400 1405 

gtg tgc cag cgc gtg gtg tgc cag cgc tat gcg ggg gec aac ggg ccc 4333 
Val Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro 

1410 1415 1420 

ttc ccg cac gag tat gtc age ggg acc ccg cac gtg ccc etc aat ttc 4381 
Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 

1425 1430 1435 

ata gec ccg ggc ggt tec cag cat ggc ccc ttc aca ggc ate gca tgc 4429 
He Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys 

1440 1445 1450 

gga aag tec atg atg age tec gtg age ctg atg ggg ggc egg ggc ggg 4477 
Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 

1455 1460 1465 

gtg ccc etc tac gac egg aac cac gtc aca ggg gec teg tec age age 4525 
Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 
1470 1475 1480 1485 

teg tec age acg aag gec acg ctg tac ccg ccg ate ctg aac ccg ccg 4573 
Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gee acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 



BNSDOCID* <WO__01928S1A2J_> 



PCT/US01/16946 

WO 01/92891 

28 



Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gee aet geg aga ceg tac agg cee tac ate att 4669 
Ser Ser Asn lie Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He 

1520 1525 1530 

ega gga atg gcg ece ceg aeg acg eee tgc age acc gae gtg tgt gae 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gae tac age gee age cgc tgg aag gee age aag tac tac etg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gae tea gae cec tat cca cee cea eee aeg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

15 70 1575 1580 

eag tac etg teg geg gag gae age tgc ceg eee teg cee gee aee gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gae 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 

Ser Ser 
1615 
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aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 5 120 

<210> 3 

<211> 1615 

<212> PRT 

< 213 > Homo sapiens 

<400 > 3 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe Glu Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 

Asp Val Ser Glu Glu Ala He Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser Pro Asp Gly 
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100 105 110 

Leu Ala Cys Asp Tip Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu . 

115 120 125 

Thr Asn Arg He Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala lie Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr Pro Arg He 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys lie He Val Asp Ser 

180 185 190 

Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Tip Ala Asp Ala Lys Leu Ser Phe He His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 

245 250 255 

Thr' Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 

260 265 270 

Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin Val Leu Ser 
275 280 285 
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Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Ghi Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr 
305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met Gin Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 

340 345 350 

Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp lie Val Leu Gin 

355 360 365 

Val Asp Asp lie Arg His Ala lie Ala He Asp Tyr Asp Pro Leu Glu 

370 375 380 

Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie Arg Arg Ala 
385 390 395 400 

Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr Glu lie Asn 

405 410 415 

Asp Pro Asp Gly He Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp 

420 425 430 

Thr Asp Thr Gly Thr Asp Arg lie Glu Val Thr Arg Leu Asn Gly Thr 

435 440 445 

Ser Arg Lys lie Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala lie 

450 455 460 

Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu 
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465 470 475 480 

Asa Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu Arg Arg Val 

485 490 495 

Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu 

500 505 510 

Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys lie Glu 

515 520 525 

Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys 

530 535 540 

Leu Pro His lie Phe Gly Phe Thr Leu Leu Gly Asp Phe lie Tyr Trp 
545 550 555 560 

Thr Asp Trp Gin Arg Arg Ser lie Glu Arg Val His Lys Val Lys Ala 

565 570 575 

Ser Arg Asp Val lie lie Asp Gin Leu Pro Asp Leu Met Gly Leu Lys 

580 585 590 

Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg 

595 600 605 

Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg 

610 615 620 

Cys Gly Cys Pro lie Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys 
625 630 635 640 

He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala Ala lie His 
645 650 655 
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Arg lie Ser Leu Glu Thr Asn Asn Asn Asp Val Ala lie Pro Leu Thr 

660 665 670 

Gly Val Lys Glu Ala Ser Ala Leu Asp Pile Asp Val Ser Asn Asn His 

675 680 685 

lie Tyr Trp Thr Asp Val Ser Leu Lys Asn lie Ser Arg Ala Phe Met 

690 695 700 , 

Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro 
705 710 715 720 

Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp 

725 730 735 

Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly Gin Phe Arg 

740 745 750 

Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu 

755 760 765 

Asp Pro Thr Lys Gly Tyr lie Tyr Trp Thr Glu Trp Gly Gly Lys Pro 

770 775 780 

Arg lie Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val 
785 790 795 800 

Asp Lys Val Gly Arg Ala Asn Asp Leu Thr lie Asp Tyr Ala Asp Gin 

805 810 815 

Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met lie Glu Ser Ser Asn 

820 825 830 

Met Leu Gly Gin Glu Arg Val Val lie Ala Asp Asp Leu Pro His Pro 
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835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr Asp Tip Asn 

S50 855 860 

Leu His Ser Be Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 870 875 880 

Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His 

885 890 895 

Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin 

900 905 910 

Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg Cys Gly Cys 

915 920 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He Ser Arg Met He 
945 950 955 960 

Pro Asp Asp Gin His Ser Pro Asp Leu He Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys Phe He Tyr 

980 985 990 

Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp Asp Gly Thr 

995 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 
1010 1015 1020 
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Gin Pro His Asp Leu Ser lie Asp He Tyr Ser Arg Thr Leu Phe Trp 

1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr Be Asn Val His Arg Leu Ser Gly Glu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

', 1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

Tie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 
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1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val lie Gly 
1380 1385 1390 
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lie He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe lie Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro Be Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

lie Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie De Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 . 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 
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1570 1575 1580 

Set Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 4 
<211> 1615 
<212 > PRT 
<213> Homo sapiens 



<400 > 4 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 
1 5 10 I 5 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe Glu Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 
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Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val He Ser Gly Leu Val Ser Pro Asp Gly 

100 105 110 

Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala lie Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr Pro Arg lie 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys lie lie Val Asp Ser 

180 185 190 

Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe lie. His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 

245 250 255 

Thr Are Ser lie His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 

SNSOOCID: <WO 0192B91A2_L> 



, „„ PCT/TJS01/16946 
WO 01/92891 

40 



260 265 270 

Glu ne Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin Val Leu Ser 

275 280 285 

Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr -Thr 
305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met Gin Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 

340 345 350 

Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp lie Val Leu Gin 

355 360 365 

Val Asp Asp lie Arg His Ala He Ala lie Asp Tyr Asp Pro Leu Glu 

370 375 380 

Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala ne Arg Arg Ala 
385 390 395 400 

Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr Glu lie Asn 

405 410 415 

Asp Pro Asp Gly He Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp 

420 425 430 

Thr Asp Thr Gly Thr Asp Arg lie Glu Val Thr Arg Leu Asn Gly Thr 
435 440 445 
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Ser Arg Lys lie Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala He 

450 455 460 

Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu 
465 470 475 480 

Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu Arg Arg Val 

485 490 495 

Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu 

500 505 510 

Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys He Glu 

515 520 525 

Val lie Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys 

530 535 540 

Leu Pro His lie Phe Gly Phe Thr Leu Leu Gly Asp Phe lie Tyr Trp 
545 550 555 560 

Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys Val Lys Ala 

565 570 575 

Ser Arg Asp Val He lie Asp Gin Leu Pro Asp Leu Met Gly Leu Lys 

580 585 590 

Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg 

595 600 605 

Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg 

610 615 620 

Cys Gly Cys Pro lie Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys 
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625 630 635 640 

He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala Ala He His 

645 650 655 

Arg He Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He Pro Leu Thr 

660 665 670 

Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His 

675 680 685 

He Tyr Trp Thr Asp Val Ser Leu Lys Asn He Ser Arg Ala Phe Met 

690 695 700 

Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro 
705 710 715 720 

Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp 

725 730 735 

Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly Gin Phe Arg 

740 745 750 

Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu 

755 760 765 

Asp Pro Thr Lys Gly Tyr He Tyr Trp Thr Glu Trp Gly Gly Lys Pro 

770 775 780 

Arg He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val 
785 790 795 800 

Asp Lys Val Gly Arg Ala Asn Asp Leu Thr He Asp Tyr Ala Asp Gin 
805 810 815 
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Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met lie Glu Ser Ser Asn 

820 825 830 

Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu Pro His Pro 

835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr Asp Trp Asn 

850 855 860 

Leu His Ser lie Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 870 875 880 

Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His 

885 890 895 

Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin 

900 905 910 

Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg Cys Gly Cys 

915 920 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He Ser Arg Met lie 
945 950 955 960 

Pro Asp Asp Gin His Ser Pro Asp Leu lie Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala lie Asp Tyr Asp Pro Leu Asp Lys Phe He Tyr 

980 985 990 

Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp Asp Gly Thr 
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995 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser lie Asp lie Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly GIu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala lie 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys lie Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu lie Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg lie 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp lie Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu Lys Thr Thr 
1170 1175 1180 
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Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His Leu Thr Gly 

1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 - 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His Be Cys lie Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys lie Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cvs lie Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 
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1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val lie Gly 

1380 1385 1390 

lie lie Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie lie Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 
1540 1545 1550 
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Ser Ala Ser Arg Tip Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 5 

<211> 3096 

<212> DNA 

< 213 > Homo sapiens 

<400 > 5 

catcttctca cacgatctct cgcttcgcac tccttccttt gattggtttt caccatttac 60 
tcagacgacg gtccttcttc gatctttgca cattcttcta tcatctacta ccttcatacc 120 
cagctccgtc ccctaatatt catgcgcgga tggcccattc cgtggtgaaa attcccttct 180 
actctgctaa tctgctgttc tctctccctc ccgtcgggtt ctgctcctgc cacgttctcc 240 
cctctcccca ccaaaggctg ggttttcttt gtcagggctc ctttcccctt tggaagaagg 300 
ggggctgtat ggccttggtg cgaggccctc cagtgacagg atcccccatc acccagagtt 360 
ccacaggccc tggtagggag gagggggagc agaagaggag gtgccatctt tgcctgctgg 420 
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ggaagggcag gggccaccca cacagagctc tcccatttgc tgtggaccct ggggccactg 480 
cccagttcct tccaaaggaa agccagctcc ccaggtggtg ggagagtgat atggcttcct 540 
cttaaactta gggaattgag tgtgtggttg cttctaagtg ccttagaagc cgggagcggc 600 
tcctggaaag agcctgcctg ccacagcggg ccttaccctg gctgtgccca cagatgtccc 660 
tggggcctgc cgctcctgcc cggctctcct ggcctccccc ggtgtgggtt gggaaaagca 720 
cagcaaatta aaaaacacct ccatctctgg cctttgaaga atgcatctga acagccgaga 780 
gtgtaaaccg tggtgaaatg tggtctttcc agtttgggga gaagcagggc agagctgggg 840 
cttttgtacc cagggtttcc aagagctcct gcctccctcg gctgggctgg ccagggcccc 900 
ccgctgggac ctccagctgt aatagggaag gttttactgg gttgctggcc actgtggact 960 
gcccctaagg gcaggtatgc ctgcctttac ccgggttccc ctcctgcctg gaagatacag 1020 
cccatgggag gcctgttgtc tgtgggatcc tccagcatca gagacactgg ggccagcgtc 1080 
tgcctggtga ggtgcaggcc tggcaggccc ggtcccccac ctgcttgagc acccacggtg 1140 
gtgggggctc gctgcctccc gagacaatct atgtcattgt tgtccaagga agctaattta 1200 
gagtagaaag ttccgtgtcc agtcccactc tgtgcgtgtg ttagcagggg actctcgggc 1260 
cggagctggg tccaccctgg tagggggact tcatggggcc tgggcgacag cactgtgtat 1320 
ttgtgtgtgt gtgtgtttgt gtgtgtgtgt gtctgaggag gtggaccagt ttctcaaaag 1380 
gcctgtgacc ccaagaacca aggaatttca gcctgggtgg atcacacctt cactggtgag 1440 
tgggacaagc tgggggccct cgccacagga gcagccaggg catggggcac agttggcctc 1500 
attcacaaaa tgggagtata agtgatccct gctctggcgg ccaggacgat gagtgggaac 1560 
acaccgtgtg ggggctgcct ggcctgggtg tgccgcgggt gtccttgttg gtgatggttc 1620 
cacctgcttg tgccaccagt gccctctggg tctcacacac aactctcttc ccagcgaagg 1680 
cccctcctgc cctcaggcct cagtgctgct tccgtctcgg aaggccccag gagctcctgc 1740 
atcctssgcs tgattcctgt gtgcctgcas accccctcgc ggctgccatc tcatcctttg 1800 
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gtgcacctgt tggccagacc tcctggtagc gggtgctgca ctcccctgaa tgtgccgggg 1860 
cctgggggca gggacctggg ctcctccctc actgagtgga gggaactcag tgtcttggag 1920 
ttggggtgcc tgcaggctgg gtggtgcagg tgaaatgcag acctctcagc tggtgttcca 1980 
gagcagctgc cttcccccgc ccgagggact tcacccgcag cccagtcagg ggtggcgcct 2040 
gggtgcatcg cccgcaggct gggtaggggt ggagcctggg tggccctgcc tgtgagctgc 2100 
atagttgtcg cctttgaccc tgagttttct tcgttatctg tttggacctg tttggggcag 2160 
gcaggggatg agatctgaag ataaatgcct tagctgtgac catctccttt tgtgagaggt 2220 
caatgtccag ttccgctgca gttataacat cccatttttt gatttctttt tattttttcc 2280 
tttttctttt tgagatggag tctcgctctg tcacccaggc tggagtgcaa tggggtgacc 2340 
tcagctcact gcaacctcca cttctcgggt tcaagtgatt ctcctgcctc agcctcctga 2400 
ctagcagggg ttacaggcgt gagccaccac gcccagctaa tttttgtatt tttagtagag 2460 
gcaaggtttc gtcatgttgg ccaggctggt ctcaaactcc tggccttaag tgatctgccc . 2520 
gcctcggcct cccaaagtgc tgagatgaca ggtgtgagcc accgtgcccg gcccagaact 2580 
ctttaattcc cacctgaaac ttgccgcctt aagcaggtcc ccagtctccc tcccctagtc 2640 
cctggtccca ccattctgct ttctgtctca atgaatttgc ctaccgtaag tacctcatat 2700 
aaattgaatc ataaagtatt tgtcttttta' tatctggctt atttcactta gcataacatt 2760 
cttaagtttc atccatgttg tagcatgtgt cagaatctct ctcttttttt tttttttttt 2820 
tttttttttt ttttgcagac agagtctcgc tctgtcatct agactggagt tcagtggcac 2880 
gatctcggtt cactgcaaca tctgcctcct gggtccaagc aattctcctg cctcagcctc 2940 
cttagcagct ggaactacag gcgcgtgcca ccatgccttg ctaatttttg tatttttatg 3000 
tggaggcagg gtttcaccat cttggccagg ctggtctcga attcctggtc ttcaccacgg 3060 
gggcccgaag gacccgggca aagcgtggag gggagg 3096 
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<210> 6 
<211> 26928 
<212> DNA 
< 213 > Homo sapiens 

<220> 
<221> unsure 

<222> (12044),(12489),(26433) ) (26434),(26435),(26436) ) (26439),(26441) 
<223 > Identity of nucleotide sequences at the above locations are unknown. 

<400> 6 

gaagaccaag ggcacacagc gaggcagttt cagggcgggc agcctggggc cccacggggc 60 
ggccccggac acttgttctc acctgtggag ggcagagaag ggaacaggga gagaagtggc 120 
cggctgggag tggaggtggg tttgaggttt tactgtaaac taaatgtgta ccctctacct 180 
tagttatgaa ttatgagaca cgaagactgc gaaacagaca cactcctcta aaagtgcctc 240 
taggctgaca gggagaaagt cccgccaggc tcccagacgc cacctttgag tccttcaaca 300 
agcccgccag ggcctcttgc ccaccggtgt cagctcagcc actgaaccct ccaggaagaa 360 
gacgtgctgg taggagaaga atctcaccca ggcacagcct ggaaggggca cagaaggggc 420 
tccggaacca gcaagcccaa gttggaactc ccagtctgct actttctaga acgactgtgc 480 
ccttggcggg tctaagtaga acctctccgc gcactctttc ctcctttgta aagtggggac 540 
agcaatggcc accttgcagg ttcagagagg gcttgcagta cctcacagaa ctgagtgccc 600 
gtgaacgtgt gtgttcctcc agatttgtga cagctttgcc aggctggagt caggctgaac 660 
gcctctgccc tcatggggtt tatattctag gaagaccaac aaaaacaaga agacggaaaa 720 
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ttaaaacaac aaaagcccca ttgacaggcc gtgaagaatg ccatgaaaaa tgaatggcgt 780 
tgtgctgcag tctttgggga aacgggctta cggaaagaag gacacttgag ctgctaccaa 840 
tgagcagccg tccggtggga gggcagttca ggaagagcag acatccactg aggaggcgct 900 
ggggcagagg gcagcctggt cgctggattc gggggaggaa ccacatcagg ccatgagctg 960 
gagctggtgg tagaatgtac aggagaggcc agccagggcc agctcatgtc agacctcaag 1020 
cggggaagat gaatcgagaa tgcaccccac gagcaatggg aagccagtct acgatttaag 1080 
cagcaaaaat attttccctt cttccaccct gcatccagct ctaccagcac agcctggggt 1 140 
tctattttca agatagaata gacccagact cccagctctt cttacacttc tactactgcc 1200 
acctgtcacc cactcatgcg tccccacttg cagcctcgac ccccttccac ctgatctcat 1260 
ggcagccagg gaagctccag ggctcgtgag ggctgccatc tcaggaaaga agcaaaagcc 1320 
ttcggcacct gcagggcctg ctccaaccac acttcttcct tgacctctca gcttccttag 1380 
ccactccctt cccacatctc accctgctcc agccacagtg gtgtctctgt gggttctcaa 1440 
acacaccagg tgcactcctg cctcagggcc tttgtgcttg ctgttctctg ctgggactct 1500 
tttttttttt tttttttttg agacagggtc tcactctgtg gcccaggctg gagtgtagtg 1560 
gtgtgatcgt agctcattgc aacctcaaac tcctgggctc aagcaatcct cccacctcag 1620 
cctctcaagt agttagcttt tgttgttttg ttttgagatg ggatctcact ctgttgccca 1680 
ggctggagtg cagtggggca atcttggctc accacaacct ctgcctccca ggctcaagca 1740 
attctcctgc ctcagcctcc caagtagctg ggattacagg catgtgccac cacgcccagc 1800 
ttatttttgt atttttagta gagacagggt ttcaccatgt tggtctggct ggtcttgaac 1860 
tcctggcctc agatgatcca cctgcctcgg cctcccaaag tgctgggatg acaggcatga 1920 
gcctgtctct agtagttagg actacagaga ggggccatca tgcctggtga tcctcccacc 1980 
ttttctgctc caactctttc accccactta gcctcgtggc tcactctctt acctcttcag 2040 
ctcctcagtc aggcctsagg acccctgttg aaaattgcaa accacacccc ccaccaccac 2100 
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cacccactat tgccagcact ttctactcca tttctctgct ttacttttct cctttgtact 2160 
catcaccacc tgactcatta catgtttacg tatctttctt ctctccacta gcatggaagc 2220 
tccaggagag cagagagtgt agttttalte cctgatgtgt ttcctgtgcc cgtaccaggg 2280 
cctagcacac agtaggtgct cagtaaatgt gtgttggatg aacaaataca gtgaaaggat 2340 
ctgatctaca tttataaaga aggcactctg gctgctgagt ggggatgaga ctgtcaggag 2400 
gaaagaggcc cctgtggggg cctggccagc aggtgggtac aatggtagca gccaggagag 2460 
agggcctctt ggactcaagt ggatggggcc tgctcagggc tccggccaca ggaacaaagg 2520 
gaagggggcc caggatggcc tgtcatagag gacacattac aactggccca aagttcaagt 2580 
caggtttcta aatttgggaa gggatacaga aaaactaaag actctactgg acagtcagtt 2640 
attgaaatga ttacatagaa aatgtaccaa gaattaaaaa aaaaaaaaaa aagcattatg 2700 
aaggggccac cagagactcc cagagaggaa agggactatg ggctggatgc ggtgactcac 2760 
acctataatc ccagcacttt gggaggccga ggagggtgga tcacgaggtc aggagttcaa 2820 
aaccagccta ggcaacatgg taaaaccccc gtttctacta aaaatacaaa aaattagctg 2880 
ggcatggcag catgtgcctg taatcccagc tactcgggag gctgaggcag gagagttgct 2940 
agaacccagg aggcagaggt tgcagtgagc cgagattgag ccactatgct ccagcttggg 3000 
cgacagagca agactccgtc tctaaaaaaa agaaaaaaaa ggccagatga ggtggctcat 3060 
gcctgtaatc ccagcacttt gggaggccga ggtgggtgga tcacgaggtc aggagatcga 3120 
gaccatcctg gctaacatgg tgaaactcca tctctactta aaatacaaaa aattagccgg 3180 
gcgtggtggc gggcacctgt agtcccagct acttgggagg ctgaggcagg agaatggcgt 3240 
gaacctggga ggcggagctt gcagtgagcc gagattgcgc cactgcactc catccagcct 3300 
gggcgacaga gttagactcc gtctcaaaaa aaaaaaaaaa aaaaaaatta gctgattagt 3360 
tgggcttggt ggcgggcgcc tgtaatccca actactcggg aggctgaggc gggagaatca 3420 
ctteaacccfc gfiasgcagag gttgcaatga gccgatatca cgccactaca ctccagcctg 3480 
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ggcgacagag caagactcca tctcaaaaaa gaaaaaaaaa aagaaagggg ctgtgctgtg 3540 
gcctgggacc caaagcacac tactgcaagg tcccagggtg cctgactcca accggagcct 3600 
tgagaacatt catttgcaaa gaatgaatta aaattcagca ctattttatt ctgcaggatt 3660 
ccagcacccc aaggacagtc atttttagac ccttcagtaa cgtaataagt aaccggagga 3720 
tgtgctgagc ttccacttcc ccagacggtt gcctgtcaca gctcatcagg ccaacaaact 3780 
tttcttaggc ctcaaatttg gaaatgttca ctctcagttc gttccttaga tgcaagtcca 3840 
tcccaatgaa gtaacagggg ctcagcacct gtccaatctc attgcttccg gggacagggg 3900 
cccatgagga tgtcgtttca gcccggtgac acttgggcaa agtgcctttt ggtttccctc 3960 
ccaggctgga acgtgctggc tctgtgaagt tacgctgggc acaagagccc cccccaaccc 4020 
ggcaggactg actgctgtgg tcagaggcgc ccctggggct ttgggagcca cagaatcttc 4080 
ctgagggcag cgccggagga ggccccagtg agagtgccca ctgccaggct cattcctcag 4140 
gctgccgcag gcctctcccc aaaacaggca atgcttctca gcaacctgcc ccaggagcag 4200 
gccagggaag gccgccatcg gcctacagtg ctgggctctg gagggcttgg ttggtaacag 4260 
gccatggttt ctatgagcca gctggggtgt gaaggacaca ggctggattc acctctctgg 4320 
gcctcagttt ctgcattcaa aaagtgggaa tcatgatatc tgctctattt cttatctctc 4380 
agtgctgatg tgaacctcca ataagacttt taaaaatact ctttctacct tacttttatt 4440 
tttcatttat tttaagataa tgtctagctg tctcacccag gctggagtgc agtggtgtga 4500 
ttacggctca ctacagcctt aacctcccag gctcaagtga tcctcctacc acagcctccc 4560 
aagtagctgg aactacaggc atgcaccacc gcacctggat aattttttct tttgagacaa 4620 
ggtttcactc tgttgcccag gctggagtgc agtggtgcac tcttggctca ctgcagcctc 4680 
aacctccctg ggcttaggtg atcctcacac ttcagtctcc caagtagctg ggactacagg 4740 
tatgtgccag tacacccagc taatattttt gaaggatggg gtttcactat attgcccagg 4800 
ctsstcttsa actccag££t ttaagcaatc taccttcctc agccteccaa aetgctagea 4860 
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ttataggtat gagccacccc ccggcctata atcctaccac tttaaaaaag cctgtaattt 4920 
tagcacttta aaaaattttt ctaaattttt tatagagatg ggggacagct gtggtctcac 4980 
tgtgttgccc aggctggtct tgaactccta ggatcaagcc atcctcctgg cctggcctcc 5040 
caaagtgttg ggattataag cataagcctt accttacctt ttttttttga gttgcagttt 5 100 
tgttcttgtt gctcaggctg gagtgcaatg gcaagatctt ggctcactgc aacctccacc 5160 
tcccgggttc aagcaattct cctgcctcag cctcccgagt agctgggatt acaggcatgc 5220 
gccaccacac ecagctaatt ttgtattttt agtagagatg gggtttctct atatacctta 5280 
attttaaagc actgcattca tgtaaattgt gattaacatg gattcaagag agggagtgag 5340 
gatgaatgag ccaggcagtc acctcggctg tcaccctcca cttctctcct ccttctgaca 5400 
gtcatcgtcc atccgtttct gcagctgttt gtttgactct cctgatcatt ttgcttgcca 5460 
cataacttgc ctcctgggaa agaatgccct gggcaggccc acatgagtag tgaaaaataa 5520 
tctgcagtga aaaataaaac taagtagtct ggtccacaga gcagtcttat tttttcactg 5580 
cagatgaagg agttgacatt caggcttcat tctcatttat aagtgtttta aagacacata 5640 
cagtggattg aacagtggcc ttcaaaaaga tgtatctaca tcctaatccc tgggacctgt 5700 
gaatgttaac caagttagga aaagggtctt cccgggtgtc attaagttag agatcttgag 5760 
atgaggagct catcgtggat tatccaggtg gaccctgcat ccaaggacaa atggtcctta 5820 
gaaaagaaaa gcagaggctg ggcacagtgg ctcaagcctg taatcccagc actttgagag 5880 
gccgaggtgg gtggatcacc taaggtcatg agttcgagag cagcctggcc aacatgatga 5940 
aatcccatct ctactaaaaa tacaaaaatt agcaaggcat ggtggcgggt gcctataatc 6000 
ccagctactc aggaagctga ggcaggagaa tggcttgcac ctgggaggcg gaggttgcag 6060 
tgagccaaga tcgcgccact gcactccagc ctgagggaga aaagtgaaac tctgtctcat 6120 
aaaagaaaag aaaagcagac agagatctga gacagaagag gagagtgaag gaaaaaaggc 6180 
catgtgaaga tgaggcagag gttsgagcca tgcagccaca agccaaggaa tacctggagc 6240 
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cccagaagtt gcaagaggta ggaagaagcc tcccctagag cctccagacg gagcacagcc 6300 
ctgccaacac ctccacctca gacttctggc ctccagcact gtgagataat caactgctgt 6360 
tgttttaagc caccagattt gtggtaattt gttatggcag ccacaggaaa ctaatacagt 6420 
acctaatctt cacaaaccca tcttacagaa aaggaaactg aagtcagaga ggtagtggct 6480 
tgtgcagtgt gttaggccat tcttgtatta ctataaagaa atacctgagg ccgggcatgg 6540 
tggctcacgc ctgtaatccc agcactttgg gaggccaagg tgagtggatc acttgaggtc 6600 
aggagttcaa gaccagcctg gacaacatgg tgaaacccca tttctactga aaatatgaaa 6660 
attagccagg catggtggcg tgcatctgta gtcccagcta ctcaggaggc tgaggcagga 6720 
gaatcacttg cgcccgggag gaggaggttg tagtgagcca agattgtgcc actgcactcc 6780 
agcctgggag acaagagaga aaccctgtct caaaataaat aaaaaacaaa taaacacctg . 6840 
agactgggta gtttataaag aaaggggtta actggctccc ggttctgcag gctgtacaag 6900 
catggtgccg gcatctgctt ggttgctggg aaggcttcag ggagttttac tcatcgtgga 6960 
aggcagagcc agagcaggtg catcacacag caaaagcagg agcgagagag agagagagca 7020 
gggaggtgtg cacactttta aatgagcaga tctcacgaga actcaccatt gcaaggacag 7080 
caccaagcca cgaggggtct gcccccatga cccaaacctc ccactaggcc ccacccccaa 7140 
cattgggaat tacagttcaa catgaggttt ggggggacaa atatccaaac tatatcattc 7200 
cacccctggc cccccagatc tcatgttctt ctcacattgc aaaatatagt catgccttcc 7260 
cagtagcccc ccaaagtctt aactcatccc agcattaact caaaaatccc attcccaagt 7320 
ccaacgtctc atctgaagat gagttccttt cacctacaag actgtaaaaa tgaaaacagt 7380 
tatttactgc tgagatacaa tgggggcata ggcattaggt aaacattcct gttccaaaag 7440 
ggagaaatcg gtcaaaagaa aggggctata ggccccaagc aagtccaaaa cccagcagag 7500 
caatcattca atcttaaagc tccaaaataa cctccttaaa ctccatgtcc catagccagg 7560 
scacacteet scaaseseca ggctcccaag gccttgggca gctctattcc tgcggctttg 7620 
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cagaattcag tccccatggc tgctcttaca gattggagat gagggcctgc ggcttttcca 7680 
ggtgcagggt gcaagctgct ggtgatctac cattctgggg tgtggatggt ggcggccccg 7740 
tcccgcagct ccactaggca ttgtcccagt ggggactcta tgtggggcct ccaaccccac 7800 
atttcccctc caatgggaag gctctgcccc tgcagcagcc ttcttcctgg gctcccaggc 7860 
tttctcatac atcctctgac atctaggtgg atggtgtcaa gcttccttca ctcttgcact 7920 
ctgcacacct acaggcttaa caccacatgg aagctgccaa ggtgtatggc tggaaccctc 7980 
tgaagcagca gcctgagctg tgactatggc cctttgagcc aaggctggag ctggaacagt 8040 
ctagatgcag gcagggagca gtgtcctgag gctgtgcaga gcagcagggc cctgtgcctg 8100 
gacaatgaaa ccattctttc ctcctcatcc tctgggcctg tgatgggagg gttgtggaag 8160 
atctctgaaa tgcctttgag gcctttttgc ctctgaggcc tatttcctat tgtctcagtt 8220 
attggcagtc ggctcctttt tagttatgca aatcctctag caagaggtta ctccactgcc 8280 
ggcttgaact cctctcctga aaaagctttt tctttctttg tcacatggcc aggctgcaaa 8340 
ttttccaaac ttttatgctc tgttttacct ttaaatataa cttctaactt taattcattt 8400 
atttgctcct gcatttgagc atagggaatt caaagaagct gggccacatc ttgaatgctt 8460 
tgctgcttca aaatttatgg ccacgcttgg tggctcacac ctgtaatccc agcactttgg 8520 
gaggcctagg tgggcagatc acgagatcag gagatcgaga ccatcctggt caacatggtg 8580 
aaacccatct ctactaaaaa tacaaaaaaa ttagcttggt gtggtggcgc agacctgtag 8640 
tcccagctac tggagaggct gaggcaggag aattacttga acctgggagg cagaggttgc 8700 
agtgagccca gatcatgcca ctgcactcca gcctggtgac agaataagat ttgatctcga 8760 
aaggaaggaa ggaaggagga agggaagaaa tgtcttcccc ccagatgtcc tgggtcatcc 8820 
ctcttatgtt caaacttcaa cagatcccta gggcatgaaa ataatacagc caaattattt 8880 
gctaaggcat aacgaaagtg acctttgctc cagttcccaa taagttcctc atttccatct 8940 
eagactcatc accctggcct tggcttgtcc atatcactgt cagcattttg gtcacaatca 9000 
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tttaaccagc taatcgggag gctgaggcaa gaggatcact tgaacccagg aggttgaggc 9060 
tgcagtgagc tgtgatcaca tcactgcagt ccagcttggg caacagagea agatcctgtc 9120 
tcaataaata aataaataaa tacataaata acttaagttt atttaaagct gcatctttgc 9180 
caccatggag aaaggccagg ccagctcctt ctctctttct gcacgtgttc ctcccacctc 9240 
agctgcctct gctcctcaag gaggaacaga gggagtagga aaggccatcc caggaggccc 9300 
agcaccccat gacctggctc tggggccttg tgggtttatg gattcccagt gctgagtcat 9360 
ccctcacagg ctcttgtggg caccttggac attggtcaga agcatgtggt ccccgggaac 9420 
acaccttttc ctgatcatct gggaagggca gcttgtgcca gcgaggccac ctgttcagcg 9480 
* ccacggcccg ccagacagct gcagccacag ccttgccttt gatcagagca aacaccagac 9540 
atgtgtgtca tgcccccaac ccatctccag gggacacatg tcctttcttg ccaggcctga 9600 
gatgaacaag agagggacaa gtccccaagc ctctctctcc ttcctgcctc acccactccg 9660 
ctgttagatt ctcaaggtgg atggtgggct aactagggca accgaccatc ctggtttacc 9720 
tagaactgag ggggcatttt caggaataaa actgcaaaag tctggagcaa acaggagcaa 9780 
gttggtcact ctggggctgg tggagtcagg tttccttctg caggccccct ccccgcaagc 9840 
atgggtggaa cccaggacag gaacacagag caggccccag gaccgggctt gtca.ctta.ca. 9900 
agtctttttt mmmt ttttgagatg gagtcttgct ctgtcatcag ggctggagta 9960 
cagtggtgcc atcttagctc actgcaacct ctgccttctg ggttcaagtg atccccctgc 10020 
ctcagcctcc tgagtagctg ggactacagg tggcaccacc acgcccagct aattttttgt 10080 
atttctagta gagatgagat ggccaggctg gtcttgaact cctgacctca agtgatctgc 10140 
ccgccttggc ctcccaaagt gctgggatta caggtgtgag ccactgtgcc tggccccact 10200 
cacaagtctt aaaccatgcc tcagcacatc aatgccattt acaaaaaggt agagggattt 10260 
tccaggcaaa aatagatgaa agacatagga tgattgatca tgtcctgctt aaacataggt 10320 
ctsatgctat taagaattea gggctgggag cggtggctca cgcctgtaat cccagcactt 10380 
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tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatcct ggctaacacg 10440 
gtgaaacccc atctctacta aaaatacaaa aaatggccgc gcgcggtgac tcacgcctgt 10500 
aatcccagca ctttgggagg ccaaggcggg cggatcacga ggtcaggaga tcgagaccat 10560 
cctggctaac acagtgaagc cccgtctcta ctaaaaaata caaaaaaaat tagccaggca 10620 
tggtggcggg cgcctgtagt cccagcaact tgggaggctg aggcaggaga agaatggtgt 10680 
gaacctggga ggtggagctt ccagtgagcc gagatcacac cactgcactc cagcctgggc 10740 
gacagagtga aactccatct caaaaaaaaa ataaataaat aaataagaat tgttagtatt 10800 
ttgcaggtgt gacaaatgat tctgtttctg tggcagaatg ttctcaggag atctcttttg 10860 
aactctcatg gaaagcatca tgctgttggc aacatcacat ttatttttat ttatttatta 10920 
ttttttagag acagggtctt gctctgttgc ccaggctgga gtgcagtggc acaatcacag 10980 
ctcactgcag cctcaacctc ctgggctcaa gcaatcctcc tgcctcagcc tcccaaagta 11040 
gctgggacca caggcgtgag ccactgcact cagcccaatg taccttcaat atttacattt 1 1 100 
ctggcaaagg tagcaaaacc ttaacaaatt ttgaatctag ataataaaat tatgaggctg 1 1 160 
ggtgcagtgg ccctgacagg gatggctcac atctgtaatc tcaacatttt gggaggccaa 11220 
ggtaggcgga tcacctgagg ccaggagttt gagaccagcc tggccaacat ggtgtaaccc 11280 
tgtctctaac aaaaatacaa aaaaattagc cagacgtggt ggtgcacgtc tgtcatccca 11340 
gctactaggg aggctgaggc aggagaattg cttgaacccg agaggcagag gttgtgatga 1 1400 
gccgagatcg cgtcattgca ctccagcctg ggcaaaagca agagcgaaac tctctctcca 1 1460 
aaaaataaaa aaaaaataaa ttaatgaatt aattaaaata aaataaaata atggatagtc 1 1520 
actgtaaaga aaaaataaat gtatatatca gccaacaagt gatggaatag agcaccccat 11580 
ctccctggct ggacagatac atcccacaac acctggaagg cggctccatg tagaactttc 11640 
tggactgctt gaggtgctgt gctggagcac ggtgacagag gagctggacc atggacctcc 11700 
ccceeccccc accaassscg aggtccccct stggtgggtc teagggasgc atccgtatgg 11760 
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cctctgcggc ttgggcaggg aatttggggt ccaagtactt ggtgcaaagc ctggaaagag 1 1 820 
ggtttgggtg ctgagggcat atcccctggg ccacatgggg gcagaagtgg ggccccctga 1 188Q 
agcttggagt cctgggcagg ggcatctatt ttgctgtctg aggccttcag tacttgaagc 1 1940 
aaaatggagg cagaatgtcc caccttaatg cccctgattc ctccaaacca attccagaga 12000 
cagcaagggc cagaacaggg atggccctgc ccagggtcat gcancgagga agtggccagg 12060 
ctgggatctg aacccaggct aatcccctcc cttgtcctcc tccaggccct cacccctgca 12120 
tagagccctc cagctcactc atcctcggcc agctccatct cctcagcttg taaacccccc 12180 
cgggattttc ctttcttaaa aaacaaaggc ttggccaggc acggtggctc acgcctgtac 12240 
tttgggggtg gctcccagca ctttgggagg ccaaggtggg cggatcatga ggtcaagaga 12300 
ttgagaccat tctggccagc atggtgaaac cctgtattta ctaaaaaaaa aaaaattaac 12360 
tgggcatggt ggctagctac ttaggaggct gaggcaggag aatcgcttga acctgggaga 12420 
aagaggttgc agtgagccaa gatcgcgcca ctccacttta acctggcaac agaacaagat 12480 
tccgtttcna aaaacaaaca aacaaacaaa taaacaaaaa aaggcggagc gcgatggctc 12540 
gcgcctgcaa tcccagcact ttgggaggct gaggcgggcg gatcacttga ggttaggagt 12600 
ttgagaccag cttggccaac atggtgaaac cccatttcca ctaaaagtac aaaaatcagc 12660 
caggtgtggt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 12720 
cttgaaccca tgacctggag gctacagtga gctgagattg cgccactgta ctccagcttg 12780 
ggcaacaaga tttgtttctc taaaaaaaaa aaaaaaaaga ctggcccttc cccttcagct 12840 
cttcctcagg gtccctgagc actctacacc cccgtctaca ctgagcactc caccctgctg 12900 
tctacactga gcactccacc ctgccatcta cactgaggac tccaccccac tgtctacact 12960 
ggctgcctcc cgccctcacc tcctgctaag gccattcccc gctgcatctg tcttctagat 13020 
tctgcagcct tcagcacgct gggcccctcc tttgtcccct tgagccacct ccagcctccc 13080 
cctgagctgc tactcctctc ccagcagcct ccacccaagc ccctccagtc cccaagctgt 13140 
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cccttgcatc cagcactgcc cttccacgtg ccccttccct ccagcttcac agcagggtgg 13200 
ggcctccagg ccctgcccac tgtgcccatc cacaagttgt ggtgggagct ccgaggggag 13260 
gcaggggtgt gcatggactt gggacgtcca agtctgggac caggggcagc tggttggtgg 13320 
agtgtggagg gggataggga ctttcaggta gagaggctgt aggggcaaga tcgggacggc 13380 
ggatgtccct aaggagggct ctgacctggg aaatattgtg cagcttcctc tttgccattc 13440 
ctggagctca gacactggcc ggctctcacc ccgcccttcc tgcaggacac agctccatcc 13500 
cagtgagttc ctagtgtaga catctccagc agcacggatg ggaaaggaag tcatcaaagg 13560 
tgcccaggac cggaggcttt ttctggaggt ggcagaggag ggtgtgggtc tcagggctct 13620 
ggctgagggc aagcgtggga ggtcttaggt ctgcaccagc cccgtgaagg cccctcctgc 13680 
tccctggtgg agtcctagag ggaacagcag cccctaggct ctagcaggag tgggtagggg 13740 
cttttctggc ttcctactgt gccagcagga tagctgggcc tggcactgag cccaaagatc 13800 
acatgccggg gcattggcgc agtgaggaac agacccttgc caaagctggc aaagaagacc 13860 
ccatggggtg cagctggtga agctgagagc tcaatgtttg ggggagcctg gcaaaagggg 13920 
tcctcccctc cctctgcagg ccaggatcgc aggttttccc tacatgttgg taattctcaa 13980 
acaatcccat ggccactgga gcaaagatca cagtgggcgg cggcctcggg agcagtggac 14040 . 
agggcacgca gtgcctttga tgccagagcc ctcgccccaa agtcaacaaa ctctgcagcg 14100 
gactttgcac ccggactttg ttttcaccat acaaggaaag ggacagatca caggccctct 14160 
cgctgccctc gctgagccgg aagctgcagc gtgagctctc tcaagcccca tttctaggtt 14220 
ccccaggcgc acccctgagc ccctactcgc ctattaagtt ctcctaatag cccttcaagg 14280 
tcttaatgta tgtccattag acagagggga aaactgaggc gagggcaagt gacttgaccg 14340 
aggttcctcg gcgagcaggg cgtggagctg agaacctcgt tattactgct ccccacacaa 14400 
ccctctggcc gttcttggaa gaaggctgag ccccgggggg gccagagtga cccaaacacc 14460 
atgggccgcc tgcggtaaca cgtgcggcca cgaaggggca gcagtttccc gcccggccgg 14520 
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gctctctccg gcgctcagta tccgtcccag gccaagaaga agaaactcgg ggaggagggc 14580 
ggagggggct gcgtgggagg gcgtggaaga tggacgtggc caggggagtg gcagctgcac 14640 
acagtggatg ctgttaagat gaagggaaag aacgtgggct ccgagatcac tggacacggt 14700 
tccacctttc ttcccgctca ctgcatggcc ctgggcgggt tgttgaaccc ttggaaacct 14760 
gtttttcctt ttttcctttt tttttgagac agggtcttgc tctgtggccc agactggagt 14820 
gccgtggcac gatcttggct cactgctgcc tcccaggttc aagtgatcct cccagctcag 14880 
cctcctgcgt agctgggacc ccaggtatgt gtcaccacag ccggctaatt tttgtatttt 14940 
tttgtagaga cgggatttcg ccgtattgcc caggctggtc tcaaactcct gagttcaccg 15000 
gatcttcctg cctcagcctc ccaaagtgct gggattactg gcatgagcca ccgcacccag 15060 
cagagacctc agttttctaa cctgtgccag caggaataat gatagctgcc tagcttggct 15120 
gtgctgggaa ttaagtaaga tgaccgggta gcaaatatga agtattactg gacacagagg 15 180 
gccccaggct gggttagcag cggtggtcag ggctgctgct tcctggcctg agctcgaagg 15240 
agggccctca ttaccacctg ggtgagtcct cgtccaagcc tggcactgct gcgtgggaat 15300 
aacttctgcc acccaagttg gcagattgtg tgcaaagtta agtcctgact ctgtggggtg 15360 • 
gacttcgagg cctcttcatc ggacctgctt ccggtgactg cattcgcacc tcctcctgtt 15420 
cctggtttaa cacagcccag ctttcctcct gctgagccct ccctgggcct gctgtcaccc 15480 
tcgtgccgct gtgcctcgca gtgccactcc ctgtaccetg aatactttgc cctgcctctc 15540 
cacccagctg agagtcaggg cccctgtgag gctctgccca gcccgtcctc cgggtttctg 15600 
cctctgctga gcacttccct gcatgattgc ttctgagagt ccccccagcc tgtgagcttc 15660 
tcaggactgg gacagcttct caggaccgag gcttcctggt ctgcttgcaa ttttacaggc 15720 
gggcacattt tcccttggcc aacatcagag actggacatc tgcagatctg tgctagccac 15780 
tgagcaccca ggcaccccag caggtagctc tgtaaccaac ccattctgta aagctgaggc 15840 
tcasasasst eaascgccts gcctggsgcc acagcctgcg tcasctgcag agccassagc 15900 
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tgagatatgc acctgcggct ctgctcacag ggtcctgcac agactgctgc tggagccacc 15960 
tatgtagagt caagagagtt catgttaact ccctctcaca tccctcagcc agggtggggg 16020 
ctgacgatag acactcaggg atggcctacc ctccccaaca acccccgtca ggtttgccgg 16080 
atctccttgg aagaaaagtt ctgggcagaa ttccaccgtt ggcctggcct acactetcct 16140 
tagtggctta ggaccctcag cggtggataa gttgtgggca gaagagatgc aatcaggatt 16200 
ctcacccact caccccttgc cagccccaat aagctcaata agctgggctc ggtctgagga 16260 
agtgtccagg aaatgtgcaa atggcctggg acagccctgt gttcctttca gtaaggttgc 16320 
tgaaggtgag gctgaaagtt ggagaaacag aagccagtgc ttatggtttt aattaagata 16380 
atggaatgta tgtatgtatg tatgtatgta tgtatgtatt tatgtattta tctttagaga 16440 
tagagtctca ctctgttgcc caggctggaa tgcggtgaca caatcatagc tccttgcagc 16500 
ctcgacttcc tatgcccaaa tgatcctcct acctcagcct cctgagtagc tgggactaca 16560 
gacacacgccaactatgcctagctaatttttatttctattttttgtggagactgggttct 16620 
cactttgttg cccaggctgg tcttgaaccc ctagcttcaa gcaatcctcc tgcctcagcc 16680 
tcccaaagtg gagggattac aggtgtgagc caccacacct ggcctggaat ttatttgtat 16740 
tctgcttataaaattaatacattcttattgcagaaaagtttgaaaataaaagaaaggaca 16800 
aagaacaaaa agcgtatata atttcacagc tcagatctca ctgctattaa catttttatt 16860 
tactttcagg cttttttctt tctaggtaca tatgcagaga ttattttatt ttatttattt 16920 
tattttatat tttattttat attttttatt tcattatttt attttatttt attttattat 16980 
ttttagagac agggcctcac tctgtcaccc aggctggagt acaatggagt gatcatagct 17040 
cactgcagcc tcaaacacct gggctcaagc aatcccccca ctcagccttc tgagtagttg 17100 
ggactaaagt gtgagtctgg ctaatttttt ttactttttg tattgacaga ggtctcacta 17160 
tgttgcccag gctgatctca aactcctggg ttcaagcgat cctcccacct tggactccca 17220 
aastsct S£e attacaggca tgagccacca tecctgscct aaaateccac tttttetcat 17280 
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ttactaaaat cccatggaca ctttgacatg tctgtattct atgctattga tctgactgtt 17340 
ggcatctaca tcattatggc catctatcat ctatcataat ccattttaac attaaaattg 17400 
tgctgctgct tagatttttc tggcctgtct cctatttgta ttcttccaga taaattttag 17460 
aatcatttta tcaaattccc cttgcagaaa aagccctatt ggattttggt tgaaaaatac 17520 
tgaattttta cattaactta ggaaagggct gggcacggtg gctcacgcct gtaatcccta 17580 
cacttttcga ggccaaggca ggtggatcac ttgaggttgg gagtttgaga ccagcctggc 17640 
caacatggtg aaactcggtc tttactaaaa atacaaaaat tgccaggcgc attggctcac 17700 
ctgtaatccc agcactttgg gaggccgagg tgggtggatc acgaggtcag gagatagaga 17760 
ccatcctggc taacacggtg caaccccgtc tctectaaaa atacaaaaaa ttagccaggc 17820 
gtggtggtgg gcgcctgtgg tctcagctac ttaggaggct gaggcaggag aatggtgtga 17880 
acccaggagg cggagcttgc agtgagccaa gatcgcgcca ctgcactcca gcctgggcga 17940 
cagagtgaga ctccatctca aaaaaaaata ataataataa tacaaaaatt agccgggggt 18000 
cgtggcgtgc acctataatc ccagttactt gggaggctga ggcaggagaa tcgcttgaat 18060 
ccaggaggtg gaggttgcaa tgagcagaga tcgtgccact gtactccagc ctgggtgaca 18120 
gagtgacact ctgtgaaaaa aaaaaaaaaa ttctgaagga ttgagactct tagactctta 18180 
ggtcttccta tccaagagca caatatagct tttcatgtat tcaagccttt ttcaatgcat 18240 
caacagaatt ttacagtttt tttcatgata tcctgctatt tcttataaaa tgtattccta 18300 
gatattctgc atgttttccg gttgtttgtt aataaatatt tttcatttgt cattatttcc 18360 
taattggctg ttatttgtat atatgacatc tgttgaattt tttgattact ttgaaaatgg 18420 
ccattctttt gtgttttttt ttaactttct attttgagat aattttgact tacagaagat 18480 
ttgcaaaaat agtacagaga gttcctgttt cccccttatg ttaacccagt ttctccttat 18540 
gttaacatct tacataacta cagaacaatt gtcaaatcta agaatcaacc tgggcacaat 18600 
ectattaact aaactscaga agctsttcas atctcaccag ttcttctact gctccccttt 18660 
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tctcttccag tgttcaatcc ggaatcctac attatattta gttgtcattt ctctttggtg 1 8720 
tcttccaatc tgtgacagtt cctcagtctt tctttgtctt tcatgacttt cattttttta 1 8780 
tacttttgaa aaatactggc cggttgtttt gtagaacgcc ctcagtttgg gtttgcctga 18840 
agttttttgt gattagatcg aggtcatgca ttattggaga gggtgccacc gcctcgatgt 1 8900 
gcaagctcaa tgcatcatat cagagggttt gtaatgtcag tttataccgc cggagaccct 18960 
aacctggagc atttcgtgaa ggtgctgtct gccaggattc tccactagaa agttactatt 19020 
tttccctttt taattactga atgtctgagg ggaaatactt tgagactatg caaatatcct 19080 
gtttctgctt taacttcggc tcactaagtt tagcattcat ctatggatct cgcttatagc 19140 
aagtattact gtggagttct aatggtaatt ttctgtttct ctcattcctt caacctttat 19200 
taatatgctt cttcctcact tattcatttt gtttcagttg tttataccaa catggatttg 19260 

tggatattgg ttttattctt tgggttgcaa ttgaatccta tcattatttt gttagtcagt 19320 
tgttccatcc gaccttggtc attaggagcc cttgaaattt ggctcccatg cctttttttt 19380 
tttttttgag accgagtctc actctgtcac ccaggtttga gtgcagtggc atgatcttgg 19440 
cttcctgcaa cctccgcctc ccaggttcaa gcaattctcc tgcctcagcc tcctgagtag 19500 
ctggtattat aggcgctcca ccaccttgcc cggctaattt tttgtatttt tagtagagat 19560 
ggggttttat tatgttggcc aggctggtct caaactcctg acctcaggtg atctgcccgc 19620 
ctcggcctcc caaagtgctg ggactacagg cgtgagccac cacacctggc ctcctatgcc 19680 
attttaacat gcccgtcttt tctttttctt tcctactttc tgtgactgta agaagctcca 19740 
ggatacattt ttgctgccct agacttagcc tcaatcagtt ctcagaaaag ctctggttct 19800 
ttttatggga tacttagaaa actagctctg tatggcctgg cgcggtggct cacgcctgta 19860 
atcccagtac tttgggaggc cgaggtgggc agatcacaga tcacgaagtc aggagatcaa 19920 
gaccatcctg gctaacatgg tgaaactctg tctctactaa acatacaaaa aattagtcca 19980 
e S cgc sete£r c ee£ cecctg tagtcccaec tactca Seae gctaaescae eaeaacegca 20040 
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tgaacccggg aggcggagct tgcagtgagc cgagatcggc agccactgca ctccagcctg 20 100 
ggccacagag cgagactccg tctcaaaaaa aaaaaaagga aaaagaaaaa agaaaactag 20160 
ctctgtatgc tagttttttt tttaagacag ggtctctctt gcccxagctg gagtgtagca 20220 
gcacgatcac agctcactgt agcctcaacc ttctgggctc aagcaatcct cctgcctcag 20280 
tctcctaagt agctgggtct acaggcatgc accaccgtac gtggcaattt ttaaaaactg 20340 
tttgtagaga tggagtctcc ctatgttgcc tggtctggaa ctcctggcct caagtgatcc 20400 
tcctgcctcg gcctcccaaa gtgctgagat tacaggcatg agccactgta cctggcctgg 20460 
ccaaggtctg tcttttttta aaagaagttg ttgtatagtt gttttttttt ttattttttt 20520 
ttctgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggtgcg atctcggctc 20580 
actgcaagct ccgcctccca ggttcacgcc attctcctgc ctcagcctcc cgagtagctg 20640 
ggcctacagg cgcccgctac cacgcccggc taattttttg catttttagt agagacgggg 20700 
tttcaccgtg ttagccagga tggtctcgat ctcctgacct cgtgatccgc ccgcctcggc 20760 
ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc cggcctgttg tatagttttt 20820 
atctcgagtt ttctagcgat ttaatcatat tggttacaaa aaaggatgat tttactacct 20880 
cctttccaat gtttctacat attttttcat tttatctaac tgcattttaa aataaacttt 20940 
taattttaga atggtttcat atttacagaa aatgtgcaaa gatagtacag agagttcctg 21000 
tgtactccac acccggtttc cttattatta tcttaacgtg atacacaatt aataaaccag 21060 
taacattatt attcactgaa gtccacactt tctttttttt tttttctgag acggagtcta 2 1 120 
cttctgtcac ccaggctgga gtgcagtggc gcaatctcgg ctcactgcaa cctccacctc 21180 
ctgggttcag gcaattctgt ggctcagcat cccaagtagc tgggaataca ggtgcccgcc 21240 
accacgcccg gctaattttt tgtattttta gtagagatgg ggtttcacca tgttagccag 21300 
gatggtcttg aactcctgac ctcgtgatct gcctgcctca gcctcccaaa gtgctgggat 21360 
tacaggcgtg agccaccgcg cccggegtcc atactttctt tagajatcct tcctttttac 21420 



Rw.«5nnr.in- <wn oi92S9iA2 t > 



WO 01/92891 

66 

ctaacgtcct tcttctggtt caggatccca tccagaaagc aacattaccc ctcgccatca 21480 
cgtcttcaca ggctcccctt gacgggaaga gttcctcaga ctttccttgt ttttgttgac 21540 
cttgacagtt ttgaggagga ctggtatctt agtctgtttt gtgctgctat cacagactag 21600 
ctgagaccga tacatgatac atgaaaaaaa atgtattctt acagttgtgg aggctgggaa 21660 
gttcaagacg aagttgctgg ttggtttggt ctctggtttc aagatggcgc cttgctgctg 21720 
catcctctgg agaagaagaa tgcggtgtcc tctcactgca gaagatggaa gcgctaaaag 21780 
gaatgaactc cctttgccaa gccattttat aatgggcatt aatccacaaa ggatgaaacc 21840 
ctgagaaaca tcaagcttta aagcactggt tctcaacctt tttggtctca ggagcccttt 21900 
atactcttaa aacgttttga ggatcccaaa aaaaggcttc tacaggttcc atcttttaat 21960 
atttaccata tcaaaaatta aactgaaaaa attttaaatt atttattcat ttaaaataac 22020 
aaggataaac ccattacatg ctaacataaa tcatgtattt tatgaaaaat agctatattt 22080 
atcaaaacaa aaattagtga gaagagtggc atgtataatt ttttttgttt attttttgtt 22140 
tttagatgga atcttattct gtcgcccagg ctggagtgca gtggtgtgat ctcggctcac 22200 
tgcaagctct gcctcccagg ttcacaccat tctcctgcct cagcctcctg agtagctggg 22260 
actgcaggtg cctgccacca cgcccggcta attttttgta tttttagtag agatggagtt 22320 
tcaccgtgtt agccaggatg gtcttgatct cctgaccttg tgatccaccc gcctcagcct 22380 
cccaaagtgc tgggattaca ggcttgagcc actgcgtctg gcctaaattt ttgtgaatgt 22440 
ctttaatgcc tgccttctca tatttgtttc tgcattcaag ttattgcaaa atgttgtgtt 22500 
ggttgaagtt tgtaaagaaa atgtggcctc atacagttgt gtagttggaa aggcaagagt 22560 
attttgattc tctcttcaaa caactatgga caacctgctg ttacaaaacc agaatgcaaa 22620 
aagttgtagt aaatacaggt taggtgtagt gtggaatctg aaagcatgtg aatgaacttt 22680 
ctgagttttg taacattaaa gtccagttgc gttaagctac tgtgatagca tatagcattg 22740 
tcctaatact ggaattagta tcagaagtgg ggtgctactg ttaataaata aaaagaaata 22800 
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aataaatcat gtgatactgg ctcagaagtc aggcagtagg ctgtgtggaa cctgacatca 22860 
cgccatgtaa tacattggca accatttgat ccagctgtct gtcatgatga cttggaaagt 22920 
caaccacata cttacagagc ctgtagacat aggggaaaat agtataaaac agaatactaa 22980 
cagtggacct tggttcttgc cagttgcatt tagccaaata ttaaacaaaa gagatattct 23040 
tgggcagcaa ctggaccatc ttcaagtaaa agtgaaaggt aataaacaga gtccagacat 23 100 
ttgtgcccat gcgggttaag aaaaatccag ttgcttctag acaccgtata tgaaaacaac 23 160 
gctgaaaaca agcctttgag tggtaaaggc cgattaacac tcagcgcggt aacaaagacc 23220 
aggtgggcta acccgaaatg aaatgagaag cctgtggtga tgaggaggca gagaagtaaa 23280 
atcaagtttg agcatttcgt ttaggagagt ttgggctctg attacttgca catgcaaacg 23340 
aactggaaac aaacagatca gatgtctacc acttcttcga gggaattgca ttgccaaaga 23400 
agtcatgaaa gcagactcta tactgattag gcattaaaac aaaaacaatc tttaggcccc 23460 
taaacttgca tgggcaggaa gtgggctgtc aaagctgttc atcctctaag gtggacctag 23520 
ttcctagtcc ccagtataca cttcagatgt ggccctggag gacactggac atggaggacc 23580 
tcccagagga tgaggctagg gcttcatttc tccaatgacc tcagctgcct ctatttcccc 23640 
ttcttcctct ggaagtccta tcatcgttat tattattatt atcatcattt ttattttgag 23700 
ataaggtctc gctctgttgc ccaggctgga gtgcagtgac atgatcatgg ctcactgcag 23760 
ccctcccagg ctcaagtgat cctcctgcct cagcctcctg agtagctggg agtacaggca 23820 
catgccacca tgcttggcta tttttttttt cagtagagat agggctctca ctatgttgcc 23880 
agggctgatc tcaacctcct gggttcaaga gatcctccta cctcagctcc tgagtagctg 23940 
ggattcgggt gcacaccacc atgccaacta atttttaatt tttttttgta tggacaggat 24000 
gtacagtgtt agaaatggat tgcttgcaga ggcaggagga tcacttgagc ccaggagttt 24060 
gatcacactg tgaaccatga tcgcacccct gcactccaat ctgggcaaca gagtgagacc 24 120 
ttetctcaaa aaaaaaaaaa aagagaeaga gagagagact caaagatagg caaaaaagtg 24180 
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ggaaagcttt atagtggaca aaaaggaacg ctctaagtct gccctattgg catggtgctg 24240 
aa ggtgggct aactagagat agggggtact atgtggttga ctatgggtgc atctttggct 24300 
ttccctgggt gatcctaagt tggaagcagg gacaaaaatt agggaagctg ttagttattc 24360 
atcacgttct ggcagtagtg gactggttgt gatagaagtt attgttttgg ccaggtgcgg 24420 
tggctcatgc ctgtaatcct agccctttca gagttcaacg tgggtggatc aggaaggagg 24480 
gaggatttgg gaggtcagga gttagcctgg ctaacctggc gaaatcccat ctctactaaa 24540 
aatacaaaaa ttagctgggc gtggtggtgc atgcctataa tcccagctac tcgggacgct 24600 
gaggcaggag aatcagttga acctggggag gcggaggttg cagtgagcca agatcgtgcc 24660 
caatttcatc tcaaaaaaaa aaaaaaagtt atcgtttagc ttcctcgatt gttactggac 24720 
gtagtaatct ggcttcctgc aagtctaact ttcagcagac tggctacatg ggctgtgtac 24780 
tgtagataag gcagtaagta aagcaaaaat tgatagagca tcaaggataa atagaaaatc 24840 
cgtaatcaag cagaagattt gaacacttca ctttcagtaa ctgataaaac aagtagacaa 24900 
aaaaaatcag taaggatgta gaagatttga acaacgtaat taacaaactt gacttgattt 24960 
acacgtctag aaccctgcag aacacacact ttttcaagca tactcagaac atttatataa 25020 
agtgaccata tggtggacca taaagcagtt tcaacaaatc tcacaggagt aaaataacag 25080 
accgtgtttt ctgaccgtaa gtacagttaa cctagaaatt gaaaacaaaa agctagaaaa 25140 
accccatgta tctggaaatt ttaatataca ctttgaaata acaaatggat cagagattaa 25200 
ttcaaatagg aatttagaaa taccttgaac tgaaaaataa tgagaatact ataccccaaa 25260 
actgtggggt gcagctgaac agtatataga cgaaaagtat actcatatgt gcatacctta 25320 
aggagcgggg aggattgaaa gttaatggga ggcaaaagca ggtggatcac ttgaggttag 25380 
gagttcaaga tcagcctggc taacagggtg aaaccccatc tctactaaaa atacaaaaaa 25440 
ttatccaggc gtagtgaggc tgaggcaaga gaatcgttgg aacccaggag gcagaggttg 25500 
cagtgagccg c£att£C£cc actgcacccc agcctgggag acagagcgag actccatctc 25560 
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aagaaagaaa aaaaaaaaag aaaaggccag gcgcggtggc tcatgcctgt aatcccagca 25620 
ttttgggagg ccgaggtggg cggatcacga ggtcaggaga tcgagactat cctggctagc 25680 
acggtgaaac cccgcctcta ctaaaaatac aaaaaaatta gccaggcgtg gtggcgggtg 25740 
cctgtagtcc cagctactca ggaggctgag gcaggagaat gtcatgaacc caggaggcag 25800 
agcttgcagt gagccgagat cgcgccactg tactccagcc tgggcaacag agagagactc 25860 
tgtctcaaaa aaaaaaaaaa gttaatggga taaacatcca tctcaagaag ttagaaagga 25920 
atgacaaata aaccaaaaaa aaaaaaatca aaagaagaaa atcataaggt caagactata 25980 
aagagagtgg ctgggtgcag tggctcaggc ctgtaatctc agcattttgg gaagcagagg 26040 
tgggcagatc acttgagccc aggagttcaa gaccagcctg agtaacatag agagacctca 26100 
tctttgctga aaataaaaat aaaaaattag ccaggcatgg tggtactgag gtgggaggat 26160 
cacttgagcc taggaggttg aggctgcagt aagccatgat tgtgccactg cacttcagcc 26220 
tgggtgacag agtgggaccc tgtctctaaa aaactaaaat aaggctgggc gcggtggctc 26280 
aaatctgtaa tcccaccact ttgggaggcc aaggctgagg tcagcagttt gagaacagct 26340 
tggccaacaa gatgaaacct catctctact aaaaatacaa aaaattagtt gggtgtggtg 26400 
gcatgtgcct gtaatcccag ctacttagga ggmmnctnt ngattatatl ttctccttcc 26460 
tacgtcgtta ttggactgaa ttcagaatga tgactctcat tggagctctt cctgtctcct 26520 
aactacagtg gcttccgacc ccactctggt tttcacttca cccctctgct gctcatacga 26580 
gtagatactt ccttccttct ttctcacttg ttgctcttcc tcaacccccc ccgttggtgt 26640 
cccctcctct ttatcttttt ctcgcgacac ctgcgttctc ttgccctctt atcatccctt 26700 
tctcgaggcg gtcctttcct ttatccagct taaatacctt ctcctctgtt tatttggggg 26760 
ttgggttttt atctctcacc ctccctctaa tttctttcct ctttccgcac ccatcaagcc 26820 
tctcgtggtt tctcttcctc tactctcggg tcccccccct ctccccttct ttttttcttc 26880 
acccccccaa ecectttgcc ttttttttct ttgcccttta ttcccccc 26928 
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<210> 7 
<211> 29430 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> unsure 

< 222> (4336),(4345),(4349),(4392),(4447),(4490) 

< 223 > Identity of nucleotide sequences at the above locations are unknown. 
<400> 7 

aggggaaggg ccggctccgt agctcacacc tataatccca gcactttccg aggagagagg 60 
atcatctcag gccaggagtt caagaccagc ctgggcaaca cagcaagacc gcatctctac 120 
aaaaacttct tttaaagctt aaaaaaaaaa aaaaaagcaa agaggacagt tcaggagaaa 180 
agcctgtaga ggcagcacac taaggaggag acgcagccca ggcaccagga ggggctggcc 240 
atgggcactc actcctccag caggcgagtg cccagcacca gctggcccac ccagacaccc 300 
aggacacggc ctgaatggct ccgtattcac gtgggtggta ataaacaagc aatacacata 360 
gccaataagg acaccttagt aatgttacat cataaacgct gcagatcagg gaaatggtgc 420 
agggtgaagt gggttggggg gctgcatgct acatgagaag tgggtcgggg ggctgcatgc 480 
tacctgagac agagcaggcc ttgctgggaa agaaggagcc ggcaggcctg ggcaaaggtc 540 
ctggggtggg agcacactgg agcagagtgt gggggtagca tggcgggtgc tggtcctctg 600 
esceccttcc caccacstca tgtgcccatg tecccaaggt ctctcgtttc acagccccct 660 
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gaagctcagg ggtcacagct acacagcccc cagatacctt ggcctgcccc aggtcattcc 720 
atccagtgat ggacctgctg acctctagcc tgacctctgg gcagcgtaat ttgagaagga 780 
ggagaaggga gggcaacaga cctggggcga tgagggatgc acagggtggc agacacctga 840 
ggctgcacct tggagcctca gttctgggtg tgggtggggg atggacaggc tgagggctga 900 
agcagctggg cccggccacc atcacacccc aggacccacc agatcaccat gaaaaaccga 960 
atgtcaactg gcagcccaga gtgcagaaca aacctttcag aaacacggtg gtgactgccg 1020 
catcatgaac ataaaataat tacgccctct ccccagggat cacccctgca ggagtttgtc 1080 
ccaagaaaca ccagaaagaa ggaaaacgtc tgagtcacaa tatttgctga ggccttattt 1 140 
gtaatagcaa aaaaaaaaaa aaaaaaagaa caatctccag cggcaggggt aactagacta 1200 
ttgtctccgt ggaaaggtag caccaattaa ctagtaacaa aatgactgcg gtaacaacaa 1260 
aacgttcgac atgtcaacac caaaaaccac acacccagca taaccgtgaa ccatgatttc 1320 
tactagaatg aatggcagtt atgagaaagc accagcggag acaaagattg aaaaagtaaa 1380 
ggtggcctca ttagggagac aagtctctgg gtaatatatt gtaatactgg taaatatata 1440 
gtttttaata tattttttaa ttccaaattc catatatgtt cctatgaagc tatttctgca 1500 
aatatttttt tcaggaccgt acatcacaaa ggcaaaaggg ccaggtcagc tctccagctg 1560 
agagtgacca cttcagagca gacggcagac tccagggtta gcaagcctgg ctgagacctg 1620 
gcccatgaca atcactcaac ccctctgacc tcaacatcct gtctgtgaaa tggggataat 1680 
tactgcacct ccacatcaca gagtgcgagg cttaaacagg atgcttcata gaaaagcgct 1740 
caagaggtaa cagccgggag ggggtagtgg ttttcattaa ttaaatgttg ccttcatcca 1800 
gccctgggcc agctccaaca caaagcacac accatccact cagactcagt tgcctggatt 1860 
caaagcccgg cctggcctcc agctgtgaga ttccgggcag gatttcccat ctcccagagc 1920 
ctcagtttcc tcattcatga aacaggaagt gatcattcct tttattttta tttttatttt 1980 
tatttteaea ceeastttca ctctasttec ccasecteea etateatesc ecaatctcag 2040 
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ctcactgcaa cctcggcctc ccagtttcaa gcgattctcc cacctcagtc tcctgagtag 2 1 00 
ctgggattac aggcacacgc caccacgccc agctaatttt gtatttttag tagagacggg 2160 
gttttgccat gttggtcagg ctggtctcga actcctgacc tcaggtgatc cgcccgcctt 2220 
ggcatcccaa agtgctggga ttacaggtgt gagccaccaa gcccagttga caactgcttt 2280 
taaagacacc tctggctgct gtggaaaaca gcctggtagt gcctcaaaaa gttacacata 2340 
gaatgatcct atgaccagta attccactcc tacatatata cccaaaagaa ctgaacccct 2400 
ctactcatgt atgtacacat acaggtacac gcatgttaac agcagtgttc acaaagccaa 2460 
aacatggaaa cagctcaaat gtccataacc gatgaacgga taaatgaaac gtagtctatt 2520 
caccacctga cggaggtgag aggggccata aaaaggaatg atgcataaaa acgaatatta 2580 
tggccaggta tggtggctca cgcctgtaat cccaggactt tgggaggctg aggcgggcgg 2640 
atcacgaggt aaggagttcg agaccagcct ggccaacacg gtgaaacccc atctctacta 2700 
aaaatacaca aattagctgg gcatggtgga gggcgcctgt aataccagct actccggagg 2760 
ctgaggcaag agaatccctt gaacctggga aacagaggtt gcagtgagct gagattgcac 2820 
cactgcactc cagcctgggc gacagaccaa aactccgttt cggaaaaaaa agaaaaaatt 2880 
agccaggtgt ggtggcgggt gggtccctgt aatcccagct ctacttggga tactgaggca 2940 
ggagaaccac ttgaacccgg gaggtggagg tagcggtgag ctgagattgt gccactgcgc 3000 
tccagcctgt gtgacagaag gagactctgt ctctaaaaaa caaaaacaaa aaaggcccga 3060 
cgcggtgtct tacacctgta atgccaacac tttgggaagc caaggcaggc agatcatctg 3 120 
aggtcaggag tttgagagca gcctgggcaa cacggtgaaa ccccatctct actaaaaata 3 1 80 
cagaaattag ccaggtgtgg tggcacatgc ctgtaatccc agctactcgg gaggctgagg 3240 
caggagaatc gcttgaaccc aggaagcgga ggttgcagtg agccgacatt gcaccattat 3300 
actccagcct gggtgacaga gtgagattct gtctcaaaaa aaaaaaaaaa aaaaaaaaaa 3360 
ctaaacaaaa ecaaaaaaac caatgagtaa tgttgtcaag tgaacttcat cccaatggga 3420 
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atgcagataa tttgtttaaa aggcaccatg cacactgggc aggctggctt cccctgggaa 3480 
cgtcttcttt tgcctggatt cccagttggt ttaatcgggc gtagaacact ttcttcaatc 3540 
cgggattcag gcacccctgc tcagcacaaa ctcagtacac cccgcactct gctgtgggtt 3600 
cttggcacta ttaggagaat gtgagggggt gattcagatc tatctctagt gggtgcatgt 3660 
ctgccactcc caggaacgcc cacttctggc aagtcagtgt cagagaaagg ccagctcgtg 3720 
gcccctcctg ccttgagtcc caggacccgt gatcagtcct acccggagca gaatcaggag 3780 
tttgaaaacc caagtgccaa caatctcatt ttaacccatg taagcatatc caatatttat 3840 
atatagaatt cataacagat gtctgggctt ccattccaat agcctatatt ttaeactgtt 3900 
tatttacatg gttacaccaa acaagactca attcaaggta acccaatcct ttgctactat 3960 
accaaaataa gcaacatttt cagtccatgc cttatatata ttcaccaagc attacactag 4020 
gcctccaact gctcatcgga gcaagctgca gcctggacac aagctagaga ttaatcagtc 4080 
aggaatgatc ctgcgtccag tgccagcatg atggaagaga cagagaaaca gaagacatca 4140 
gggctccaga gtcaaggagc ctgcaggtta gttgggcagg atatacacac atacacacac 4200 
acacgcacac acaaaaccac ccaagaagaa aaggtgggat gaatgcatgg acaggtaatg 4260 
cctggagcct ggggatggat aagctgactg caggtggccc aggcaggctt cctggaggaa 4320 
gaagacctgg ctgtangtgg ggtangcang ctttctaaat ggggaaaatc tggctgtggg 4380 
tggagttggc angtttccga aaagaagaaa agctgactat gggtacacct ggctgttggt 4440 
ggaacangca ggcttcttgg aagaagaaaa tctggctgtg ggtggatcan gcaagcttct 4500 
tggaagaagt aaacctgact atgggtggac caggcaggct tcctagagga agaagaccgg 4560 
ctgtgggtga accaggcagg cttcctagac agaggaagat ctggctgcgg ttagagtggg 4620 
caggcttcta agaagaggaa gggctgactg tgggtagacc tggctgtggg tagactgggc 4680 
aggcttcctg gaggaggaag agctggagca ttgaaaaaca aacatgactt ggtgaatgtt 4740 
gagcatgccc aggcctgatc cccagaggca attacgcact caagttactt aattctactc 4800 
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acaatgcctc acaaacaact tctctgacac ctaacacagc tctgggcacc ttctagcttc 4860 
agctcctcaa agcagttatt cacgctacta ccctgcacac ctcctcacac cccaacccca 4920 
gggacaggag ttctgccaga tgccaaagct cctgatgcca aagcctgggt ctgcttccgg 4980 

gctcctcttg gtctaactgt ccaccccgca tcggcatgat gtgcaaaaac aaggctttgc 5040 

i 

aatctgccct gatgcctggc ggagcgagtc cctcccgatt cgtctccttc agaaacacct 5100 
gggctgccct ggtcctgtta tacccccaac acattctaca gtcagctccg caagttccac 5160 
aaagatcaac gctggcgttt ttatggcatt ttatttacag tttttacaat ataaaaaagg 5220 
aaggatgcca cagctcagcc agcaggacag acagagatct atgatgcttc tgctgcacca 5280 
ttgtttgtgg tcaagaaagt ctgttttcaa tgatttatta aattgtggtg ggagatggat 5340 
ggtggcagtg gttaccagca acatgaatgt tcttaatgcc actgaacttc acacttacaa 5400 
atggttacga cgataagtgt tatatgtatt ttaccacaat taaaaacagg taaatgcagg 5460 
ccgggcacgg tggctcacga ctgtaatctc agcactttgg gaggccaagg caggcagatc 5520 
acctgaggtc aggggttcga gaccagtctc gccaacacgg tgaaactctg tctctattaa 5580 
aaatacaaaa attagccaga tgtggtggtg catgcctgta atcccagctt ctcaggaggc 5640 
tgaggcagga aaatagcttg aaaccgggag gcagaggttg ccatgagctg agattgtacc 5700 
attgcactcc agcctgggtg acaaaagcaa aactctgtct caaaaaaata aaataaaata 5760 
aaaataggta aatgcaaaca tatggtatag taatattatg ggctattatg agctacaaaa 5820 
aagaatgact tgggactaca gttacagccc tcattcagga atttgtttta aatgtgggtt 5880 
ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 5940 
gcacccctct aagctggctc ttctgtccca gaggtcccca ccagtcctcc agaacttctt 6000 
tgctttctta cacaataaga tgccccatgc tcggcttgta cctttccttg ccccagccct 6060 
agaaccagct tcttcgtgga caagctctga ctcctttggg tggagaatgg tattcagaaa 6120 
cccagaccte ggctct£gt£ tgctcactgc tacttggggt cattgcttct aggcctctct 6180 
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gctgatggag gtaggatata cacgtacagt cttccctctt cccagattcc gtacttgagc 6240 
tcgcctactt gctaacattt atttatatcc cccaaattaa acctcacagc acttctgcaa 6300 
tcactcactg acttgcagag tgtgaaaaaa ctgagtcacc atcacacgtt ccaaactgag 6360 
gtcaactgag gccacaacgc cccatcttct tgctccggct gtcgagatgt aagcaagtgt 6420 
ccttctctcg gtctagctag tgccatgctt tccacatcac tgtgcttttt gtgggcaatt 6480 
ttgctgtata aaatgtcccc tgcacatatg ctgctgtgta gtgctcctag gtgcatgagg 6540 
ctgccccacg ccttacagag agaatatgca tgagaggctt tattcaggta tgagttatag 6600 
cgtagttggc catgaattca atgttaatga atcaacaata tacagtaaat aaggtgcttt 6660 
ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 6720 
cactgccgcc tcaacctcct gggctcaagt gatcctccca cctcagcctc ccaaactgtt 6780 
gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 6840 
agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 6900 
caaggtggga ggttcacttg aggccagaag tttgagacta gcctgggcaa catggcaaga 6960 
cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 7020 
tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 7080 
catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 7 140 
? aa aaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 7200 
cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 7260 
gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 7320 
tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 7380 
tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 7440 
tcaagtgatt ctcagcctcc caagtagctg ggattacagg tgtgccccca caaccggcta 7500 
atttctetat ttttactaea eacgeaettt csccacattg gccacgctee tctcaaactc 7560 
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ctaacctcaa gtgatccacc cacctcagcc tcccaaagtg ctggaattac aggcatgagc 7620 
tgccgtgcct agcctacata cacttttata cacacatgca tctatgacta tttctctatt 7680 
tctgtgcatg tgtgcgtggc agtacctaca gtttcagcta tgtgtctggg tactgtctcg 7740 
tccaagtttg taagcacctt ctccaaagtg caaagcctgg cttgtgttac tatccatatg 7800 
tttacttatt tgctcaatca atttacttat tagctccata accagcttcc catctgctcc 7860 
agtagcctct gctgtcagtc acctctgcac cctaccccac cttgcttccg gatgctggat 7920 
gccaatcacc cccgacacct ctacatagca ccaccctcga catgctgctt ctttatttct 7980 
tatttatttg tttgagatgg agtcttactc tgttgcccag gctggagtgc agtggcacga 8040 
tccaggctca ctgcaacgtc cgcctcctgg gttcaagtga ttctcctgcc tcagcttctc 8100 
aaatagctgg gattacaggt gcccaccacc acgcccagct aatttttgta tttttagtag 8160 
agatggggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca agtgatccac 8220 
cttggcctct caaagtgctg ggattacagg tgtgagccac cgcgcctggt ctgcttcttt 8280 
aaatgccagg caccaacatt tgtgcaatgg ggtgggagga aagaacaggg aggagagcac 8340 
actgccggcc cctgcactga atccactgat caatctgggg gcaactgcca tctccatctc 8400 . 
ctgtcttcct atccgtgaac atctactgca gtcctctcca atgtccttct gtaaagttgt 8460 
attatgtttt gcatacaggc cttgcatatt agttctcaga tataatccat atactttata 8520 
taaaattcaa accacattta aaaaaataaa actagcatga ctataacgga gtctgcaaca 8580 
ttctcacaga ctttatgata aaacatgaaa cttcaaagat acttagggtg gggcagggac 8640 
aatgtttaag gctgcctgga agcctcccca tccctgagcc agaaagtcct atctcccctt 8700 
caaggggaaa tgcttgaaaa agcactgatc aggctaaaat gacagggatc agggagtaat 8760 
caaagtacaa gtgagctggt ctcctccatt ctgagcacag caaagttcag tctctccaag 8820 
tccaagaatc atacacctgt ttgccaagaa tgaagttcag gtgtctacaa gtggctgaaa 8880 
atattcatte ctsssccatt aacaacattc ttsscaaaac cataccttae cttctcfftss 8940 
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aaatttctta aggtagaaga aacaggaaac acccaggctc gcttttatgt agacagttcc 9000 
atgaagccag ggaccttccc cacatccacg tttcaattac ctgcacgcag ctcacagtgt 9060 
attcaacatc tacgcgtctc tcctactggg gtggcggtgg ccactcaaac cctcatgcag 9120 
ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9 1 80 
tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240 
aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 9300 
tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 9360 
tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 9420 
tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cccatctacc 9480 
ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 9540 
gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600 
taaaaaccct ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 9660 
ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 9720 
accttcactg gagagagccc gcacttcacc tcctcagggc aggcatggct gatgccaccc 9780 
agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 9840 
ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900 
gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960 
catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 10020 
ggacagtttg cctcttccca aaccaacttg gattataaaa agcttttctt aacgaccaca 10080 
agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 10140 
ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 10200 
catcaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 10260 
ecccaetcac ccaeacasca ccccactccc cacacacacc tcacatctsc ccecctcaca 10320 
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ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 10380 
cccccattcc tgttggcccc tggcccccca tegaattctc tctaatccta atgcacacac 10440 
ttgcacactc aaacacacac acacacacac acacacacag cccagaggaa aaccataatt 10500 
gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 10560 
cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 10620 
caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 10680 
acaggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 10740 
accgcagagc tgggggctat ggccctgaca ccaaggccct ggggcaggcg gggaggcagc 10800 
tgttctcctg cctgtgctcc cgggcagggc ctggccccac aagggaactg gccgaaggct 10860 
ctgcttggct actccggaaa gtcctgggag acaagcaaag gacttgctag gtcactccaa 10920 
acggcccaga tgtgacaact gtgaagaagc cacaccaaag caaggtgaca gaacaatgtt 10980 
ggtgacgtca ggttatcagc ttacgctcaa ctccacttac ccggactcac ccgtaacctg 1 1040 
ccgtctcttc ccaaccagta aaggatgcct aggtagaggg gcacaaggcc tggagcataa 11100 
ttaccatttt aaaggctctg agaagtcctg cggtgaggaa gcctagttca ctttctctcc 11160 
cctaggattt cccaactgcg cctgatcaca gaacattttt tcatttecac tcaggaaaca 1 1220 

tattttgaaaaacactggcctagaggcagaagtgaaatggaaaacacaaaagtaaaactg 11280 
aacaggaggc actgggcaga gaacggtcag aggcgccctg aatcctggac cggtggagat 1 1340 
ccccagcttg gcatgctccc ctccctgggc ccagaccgcc tccccccatt tcctggataa 11400 
gaaggctaat gcgcatcagg gtgaagggct tgcctgggct acacccccag gctcgcccca 11460 
caccaatcgc gctcctgcga gagccagtga ctttcttgat ttggctactg tggaattgtt 1 1520 
tgcaactaac caccccagat acagatacaa atgacaggat gatcagatgt aaaggaccca 1 1580 
caggtctctg tgatacggct tcatgcagcc agcatggcta gtgccgtgca gaatgagaat 1 1640 
gaccccaggc aagtccttgc ctcccagacc cagaacccca tggagcccac cagggctggt 1 1700 
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tcacaagcac tgtctgggtc gggcagagat tccagcaaga ggagggaaca tccatgcacc 1 1760 
ggagccagtt accagaagca aatcgcctct tccaaaaccc aggctattaa tggagtccac 1 1 820 
tgttgagtgg agctggggtc tagctatgga atactgcaca gcagagatct tcctgagaga 1 1880 
aagcagtttt ccctgaaagc catgtgtcct ccactaactg tgttttaatt gggcgaacgt 1 1940 
ctgtatctca ttgcagtggc cgcgcatgtg ctgacaaggg gctgggggcg gggtggggag 12000 
cagaagctca ggggcctggg agggaaggaa acaggccacc agggctcccc agaaggcatg 12060 
tatctctctc acaaacacac gcatgcacac acacgtgcac acatactctg caagccctga 12120 
gttagcaact gtggaatgtg accagctcag tgatcccagg acaagctgct agggaatatg 12180 
acatttgatt gatgtctgca aatgtgcgtt ttcactaatt agaaggttta gggcagagca 12240 
gagaaaaata tgtatttcag agtcccagtt tgacctgcca gaaaccagcc cattactaac 12300 
attcttattt tcaacaaaat atagcattct gattacatac catcttggtt ccacgcctcc 12360 
tgccttgcca agcccccgga agcggcccaa ggccatggca aatagtgaga gaaacagttc 12420 
cagggtggag actgactcag gggtgtcagt cagtggggcg ctgatggccg gtgggaggcc 12480 
agcagtcatc accctctcct tgggacagtt gagtagctct cccccagggt catgtggcca 12540 
ctcaggttca tatgggaggc gagaggagtg gcagagtcca ggagagtggc tccgaagtca 12600 
ctgttccctc caggcctcag tgtcttcatc cattaaatgg gtaggctgag gtctgggatg 12660 
acaaggaggg cttgcactta ctgaaaccca tgggaggctg ttcgccgatt tcttttattg 12720 
atggaagaaa acactcgtat aattcaagta ccaattaaaa ggcaggcact ggaaccaccg 12780 
tctgccaatt cctagttttg cctataccaa atttgagcaa gttaattgac ctctcccagc 12840 
ctcagtttct tcgtctgtaa aatgagggta gggatggccc ccagcccaca gggcagctgg 12900 
aaggattaaa gaaatcaaac atctcttaga gcccacctgg cacactgtga tacacaacaa 12960 
atgttagcta tttttgtcta tgaagtctag attttatatc ttgggtgttc taaagcagga 13020 
tacatttatt taaaaacaag gattttcatt aaacacgtac_cccacagaca gcaaccccat 13080 
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ggagactgct cttaattcag gccagtatcg aaacgactct aactacaagc tttatacagg 13 140 
tctcttggct gtccttcaaa tccaactaag gtggtacttc tgaagcactg tgcacatgtg 13200 
tgtgtgcatg cacacgtgtg ggaagggcgg gctcacggat ccctcaggta ccccacccac 13260 
gcagtctcaa gtcacaaagc gacagagcag ccgaggaagg tctgtgcccc actggaccct 13320 
cgtgaagcca ccaactctac ctctgcgccg tgtcctgcag actgggctac cctttgggtg 13380 
gggaccagca tttgatgcaa gaaaggcaga cagaaaagga aaagggcaag ttcgactcca 13440 
gataacacag acagtaccaa gccccagggt ccataaatgc cacgcagatg gaagcattta 13500 
ctgcgaggcc acacagcaaa cgcacggatc cagggacgga ggtgcagact gcggtgcccc 13560 
tgagccatga ccctgcaaat taccaccatg ggaaaggagg ctgccaaacc ccccgacagt 13620 
cggctgggct ggcacagact cgtggtttcc atcgaggtgg gaggaggtgg gacgtcccag 13680 
cccctccccc atgcccactg cagagggaag cggccgtttc ccctgtgtgg ttacaaaggt 13740 
ctcattgttc ttcctcacag ggaggaaact ggaggaccga gctcagaacg cattttagaa 13800 
ctggcagaaa agaacatctg gggaaggaaa cacatttcag aaacaaacat acctttgtac 13860 
cagcttttat tttctttaag tgttgaaaaa ataataataa taaagacatg ccaaatttat 13920 
catcgctcta caaaatccct ttattgagca aaacgtggca gctctacttt caaatgatta 13980 
ctgttcctgg aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 14040 
aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggaca'gat 14100 
ttggggaatg ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 14160 
aggctaacat tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 14220 
agctgcctgc aaagtccagt gttgacgatc agctttgatt tccttggaat caagtttacg 14280 
tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg ttgcaggccc 14340 
tgggtgaact cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 14400 
ffcaestecac aesacsaaac acattccstc ceeessaaac ctecassaaa ectccctctt 14460 
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cttcctaagg tgccgggcct agcttcatgg gtccctaccc tccacgcctg tcacactttc 14520 
tgagtctcat gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 14580 
aactactgtc accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 14640 
tgcaaagtca cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 14700 
agagttaaat tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 14760 
tgccaaaagg gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 14820 
gtgaccctcc ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 14880 
atgaccgtgg cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 14940 
ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 15000 
actgcccctt ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 15060 
ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 15120 
acctgctggg gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 15180 
tggccagggc gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 15240 
ctgggcaaag tccacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 15300 
ggtacagagc gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 15360 
cccctcccca ccatgaggac catgaaggct tcccatgtgc cgcaaggact ctggtgtgga 15420 
gacacacgtc tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 15480 
ctcacagctg gagggccagg agctcaaggc ttcgcagggt ctgctctcat cccagaggcg 15540 
atggggagcc acagcaggct gcaggagaga gggtgggccc cctccacttc agaggcccca 15600 
tctggcccac agactggaga gcacatctct cagcaaccac ggagcgccaa ctgcgcacag 15660 
ggcctggtcg tcagagcggg gcaaaggcac tgaccgtcac ggccagggcg agggaagacg 15720 
ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg ccctgccttc 15780 
aecagtgaag ctgagtgggg aggcgctgat gcagggggcc agaaagggct gctggtcagc 15840 
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cgggaggagc cccccacaga ggaagcagcc agcccagacg cagatggcag ggtcccctca 15900 
acaatgtcct ctgaaaagga gaggcgggga ctgctctggt gacacctaca aatagatagt 15960 
cagccctcag ccccctgcca tacttctgac aaagcagagg cccccagggg aggcgcaccc 16020 
gaaggtacct gcacctgtcc cccagactcc tagagcccac ctgaccccat cccaccaggg 16080 
ctccagctac aaaataaatg ccgaggccag ctaggcaagg acgcacactc ggtaccgact 16140 
gaataggctc cacgttgtca tgagcgcaac ccacaggcca ccaggccaca ctatgcagag 16200 
ctgagatggt ttcggccaag cagcctctca gctgagctga acaagtccag agtccccggg 16260 
gggtcgtcactatggagtaacaattgcgatgcgatggtaaccctaacagctaacc^ 16320 
tgagccaggc cctgagctag gtacttttca acgctgcctc tctgcagcct caggacgagc 16380 
ctgtgggagc ataaagatca ttccctatca cggatgggga aactgagctc tgaagcagtt 16440 
aacgtgcttg tcccagaccg cagagctagg agcaggacac aacagcaggt caggcaggaa 16500 
cgggtgaggg gggcctgcat gggcttctct ggaggctgcg catacacgca acccccagga 16560 
ccccgaccctgcacctgcagctcgctactgccccctcagtgactccagcaaa 16620 
taggggaagg aggctgggaa tacctcgggt gtccgaaaca gcagcttctg cttggaggcc 16680 
actgctgcat aatggttgct gcccagcaca ccccaagcca cctgtgccac ctgtggtgac 16740 
cttccagcat gccttggtga ccaagctggc cttaggtgct gtgggcagcc aagaatagaa 16800 
cagggcccac ccctcctctt cacactaaca caaagcaaga ggcgggcact tcgactgagt 16860 
gcatccctct agctcaaggg cctcacggat cacaggggtc agggcaagat cccaattctg 16920 
cattcccgtc tgcctttcat cctgctctgc caacaacagc cagtgaggct ggggacatcc 16980 
ctgaacctgt ttctcacctg aaacacatca taccattgga ccccagccct ccgggagagg 17040 
ccctaatccctgac^gtggtgagatcagatcactggttaagtacccagaagggccttgg^ niOO 
caggggctcc agg g gtgg gg ggtgatgggc gtggtggtat cccgctctgg gctatagtcc 17160 
accctgatgg aggaggtctg tggtcagaac cgggctgtgc agggcacagg agcccagagg 17220 
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gacccccaga gctcacctgg tggtctctga gcagggctcc ctcaaccctc agagaaaagc 17280 
acagcaagga ggccgcccag agcccagcgc ctagcaccca gtggcgtgcc agacctgcct 17340 
ggatcctgga gatctctcat caccctccaa gtcagtcatg cccaacccag ggacccacag 17400 
cccacggggc cgtgaaggtg tgctgagtcc aagaaggcct tcgacactgg gaagccaagt 17460 
ggcacctcct ggtgtggagc aggcggaatc ccaccagcct ctgctctgcc agtgggcaca 17520 
gctggacgat gagcagaagg ggctgttgct taataaacgt catttcctta agaggataaa 17580 
acctttcaaa acagatggaa attttttttt aattaaaact ggtggccaaa gagatggaaa 17640 
gcaccccttg tgcctccctc ccatcgtgac ccatcctctg cacacctcaa gctgttcgct 17700 
gcccaggtgt ctcctgaggc actgggggcg ggtgagaatc cgtgagccct cggccagccg 17760 
tggctctctg gagctctgcc ccaggccatc agggcacacg ccgggcaccc tgggggccac 17820 
acagggcaga gcccagctgg gtcagcacac agggccacac tgggcacaca agtctctgag 17880 
cctcccctgt ggacgcagct ctcactatcc caccccacta ggtcccgggg atctgtccca 17940 
cagggtgata tgctgtcaca gaccactacc agagccatgg cctgctgttc cgcccgcagc 18000 
caggtagtca cttgctccac agggacaggc aacgccgcac ttgggggctg ctctgcggca 18060 
ggactagagc tccagcagct cagccctcct gagaaggaga actccatgct ctaagaggca 18120 
gacgcagcgg acggcaccaa agccaccaca agcccacggg gccctgcatg gcaggtcagg 18180 
agtccctgac cactcgctct ttgtaaccag agctgcagtg gagtctacga ggcaaggact 18240 
gtgggcggca gtggccacag caaatgaatg agtgtcccaa gggagcaggc ggctgcgggg 18300 
aggcacagcc gggacccagg agtcctccgg cactgcagca aactccctgg gccccctgag 18360 
cagcgaccag gtggcaagtg catgaactcc cgggggcata acctgggagg gtgacactct .18420 
cttcgtgttc aaattcttga gaacgcatta aaaatatcac tcagtcacct actctatagt 18480 
tttaactcaa aagtaccaaa gtagccaggc gcggtggctc acgcctataa tcccagtact 1 8540 
ttgggaagct gaggcaagag gatcacttaa gcccaggagt tccaaatgaa cctgggcaac 18600 
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at ggagggac cccatttcta caaaaaaagt gttttaaaaa attacctggg cctggtggtg 18660 
tgtgcctgta gtcccagcta ctcaggaggc tgaggcggga gaaccacatg aacccagggg 18720 
aggtagaggc tgcagtaggc tgtgatggca ccactgcact ccagcctggg taacagagtc 18780 
agactctatc tcaaaataaa tttaaaaagc accaagccag gcttggtggc tcacacctgt 18840 
aatcccagca ctcagggagg ctgaggcaag tggateacct gagtcagaag ttcgagacca 18900 
gcccagccaa catggtgaaa ctccatctcc actaaaaata caaaaattac ccaggcgtgg 18960 
tggcgggtgc ctgtaatccc agctactcag gaagctgagg caggagaact gcttgaaccc 19020 
aggaggcaga ggttgcagtg agccaagact gtgctactgc actcaagcct gggagacaga 19080 
acgagactcc atctcaaaaa ataaataaat caatcaaaac caccaagact ttttaatata 19140 
aacatttatt attccataat tccttttttg catgattaaa aatgtttata taaagtttcc 19200 
tgaaaatggt aagaatgcca agtgaaggct gcaaatgccc aagcccccac cgtggcatct 19260 
cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 19320 
agtccctttg gggtacactg ggtttcccac accccagaaa tatgggctct tactgcagga 19380 
ccatgggggt cctcacactt ggcccagaag ctgtcacata gccagacagg tgttctacaa 19440 
cctaggctag agggagctca tgctccagca gaattcgagc cagaggaggt aaaagatggg 19500 
taagatctgc tccctggaca gatgaggcct tggcctcaga acagttactg atcatctacc 19560 
agacatcaca ctagaggcag aggggcgcag acgaagacag cccctgtcct caaggccctc 19620 
ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 19680 
ggggcagaag atgggcaggt ctgctcaggg cggcacggcc tgccaggcca aaaagttcca 19740 
acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 19800 
acttggagaa aaattaatct ggccagagca taagataaat gggcaaaggg gaggttccag 19860 
aaagcaagga gaccaagtaa aagctgatgt cattggctct gaatctaggc tttcactgaa 19920 
tatgcaccgc agggcctgta ggtaaagcct cagagcccag ggagtctgag tggaggagag 19980 
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ggcaggggac agagctgggg cctgtgtcta cagtgctcag gaggaatagg catggacgtc 20040 
agctcggagg ctccagctga agtgaggagg cggccagggc agcacggcca cgcccggatc 20100 
cagactcctt ttgggaagca agttcgctct gggggaaagt ttggagaaat ggcctttacc 20160 
cgcagaagca agccccagaa catatcttgc tccaaaacta tctcgtacag tgaggacgtt 20220 
aagctteagg tcccctagag gagacagtct gctccttcct ggggcagaac ccaaggtggc 20280 
cagagcctgg aaggcaccca gcacccaggc tggtgtgttc cagcccaggc cacacgctca 20340 
gatagctatt aatgccccgt tgagcaattt cctgagagct ttgccaggca ggtaccgcct 20400 
ccccatctga actaatacag gggtacatcc caaggaagaa atgaaaggtg cccacatttt 20460 
gctctgggat taactaggga ggggagtgat aattaactca gtaattatat ttgccatcgg 20520 
gctaatgcta aaattagtgt gcattagaat ttctttcctg agcagacacc ggagtgagtt 20580 
gggcagcagg agtggctcgg gcaagtcggc acaaagggca cctccagagc cttccacaaa 20640 
tgtcagcaaa acccacaaat gtcaaggccg gctccactgc acccagcaga tgaattcact 20700 
tccacagcct gagaccgcca gctcatcgga ggccatttaa aatccagccc tctgacacct 20760 
gctggatatc accatttacc gtccccagat caagagatca aagggtggaa cctgatagga 20820 
cggctctgaa gttcaccaca aaagcataaa cgtgcaagca gagccaatac gtcttttgaa 20880 
aaggacaatg aggtgggaat ttacataact gatcttaaaa tatgttctga tgcttcagag 20940 
atggagacag cagcattccg gtacacaaag acactcacag gcagtggagc acagtgaagg 21000 
gtctggaatc aggacccagg tgtctgtgga cactacacat aaaagagcag catttacaat 21060 
gaatggatag gatggaccat cccaccaagg tgttggacaa ctccctattc actggccaga 21 120 
cccctacctc ataccatata caaaaaaaaa aaaaaaaaaa aaacccagac agaataatgt 21180 
ctgaatgtaa aacataaaac agtaacagtc ctggaagaaa ataatggagg atatatttat 21240 
aatctggaga tggagtaaca agggatagga aaaaagccat agggaaaaag tagagttatg 21300 
attatatgaa gcttcttaat atctttatga taatgtacca ccagaaacaa ggatgaagga 21360 
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ctagctacag accagcagtg aaacctgaaa caaacagaac aaagaattaa agtccatacc 21420 
aaataaagac ctcccacaaa tctataagaa aaagataaac aggctggcac cgtggcttat 21480 
gtctgtaatc ccagcacttt gggaggcgga gatgggtagg tcacttgagg tcaggagttc 21540 
gagaccagcc tggccaacat ggtgaaaccc tgtctctacc aaaaatacaa aaattagcca 21600 
ggcgtggtgg cgcatgcctg tagtcccagc tacttgggag gctgagccag gagaacagct 21660 
ggaacccggg aggcagaggt tgcagtgaac caagatggca atcgcgccac tgcactccag 21720 
cctggaggac acagcgagac tctgtctcaa aaaaaaaaaa aaaagaagaa gaagaaaaaa 21780 
gaaaagaaaa agacaacaga aaaatgggcc aaggataagt gtaggcaatt tgcagaaaag 21840 
taaataccaa taaaccagaa atgagggttg tgcaaatcaa aaggtgttat aatttttaac 21900 
caaactggac caaagaaaac accaaaaacc aaaatcttgt aattgccagc atcagagagg 21960 
atataggaaa gtgtgtgttc tcgtagatgc ttgcaggtat gaactgctac agccttttag 22020 
gagttatgta tgtatgtatg cttgtatgta tgtatttgag acagggtctc gctctgttgc 22080 
ccaggctaga tctgttgcag tgctgtgatc atggcttact gcagccttga cctcctgagc 22140 
tcaatagatt ttcccacctc agcctttcaa gtagctgaga ctacaggagt gtgcaatcat 22200 
actcagctaa ttttttaaat tttttgtaga catggggggt ctcccaattt tgcccaggct 22260 
ggtctcgaac tcctggactc aagtgatcct cctgcctcaa cctcccaaag tgctgggatt 22320 
acctggatga gccactgtgc ccggcctcaa tatctttaaa aacagaaatg gacacactct 22380 
ttgactagga atgtatccta taaaaacact tatacacatg cagagacaca cgagcaagca 22440 
tgctttgtaa tagcaatgaa ggctggaaaa actcctcaat caggtaaatg ctgtcaagtg 22500 
cacctgtgta ctatgaaatg gcacttggct tttaacaaga gcaaagacag aaaagcaaaa 22560 
gtacaaagta gggtgtgatg gcacatgcct gcagtcccag ctactcagga ggctgaggca 22620 
ggaagatcct ttgagcccag gagttggagg ccaggagctg ggcaatagtg agaaaaaata 22680 
aaattaaata ataataataa taaaataggc tgggcacagc ggctcatgcc tgtaatccca 22740 
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acactttggg aggctgaggt gggaggatcg cttgatccca ggagttcaag gccagcctgg 22800 
gcagcaaagc aagacaccca tctcaacgac aaattttaaa aaatcagcca ggcaggctgg 22860 
gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg cagatcactt 22920 
gaggtcagga gttcgagacc agcctggcca acgtggcaaa accctgtctc tactaaaaat 22980 
acaaaaatta gctgggcatg gtggcagatg cctgtagtcc cagctactga ggcacaagaa 23040 
tcgcttgaac cagggtggca gaagttacag tgagccgaga tcgtgccacc gcactccatc 23 100 
ctgggcgtga gtgagactcc tgtctcaaaa aaaaaaaaaa aaaaaaaaca aggagccagg 23160 
cacggtgggg tgagggaggg cacagaagca gcgcctcttc tgggggcacc cccaatctct 23220 
agcgatccag aggcctcagg atcctgaagg gagaaaaaac gtgaagctcc gtgctagaag 23280 
agaccataga gattggaatc agctggttct attttacaaa aaaaggaaac tgaggccctc 23340 
agaaggtgag tgcctctcaa tgccccacag ggaggcaggg agagggctct gagccctgca 23400 
gggccctgga ttcttgcaat ggggtggagt ggagcctgtg ccgcccccac caggcacctt 23460 
ctcaggagag gagccgttgt catatccttg aaggggtcct tgagcccctc aaaaggctaa 23520 
aaaccacttt cctccttgag tgaaccttca cctcagttta accacaagaa aaactacatt ' 23580 
aaggcccagc gcagtggctc atgtctgtaa tcccagcact ttgggaggct gaggtgggtg 23640 
gatcgcttga gcccaggagt tcaagaccag cctgggcaac atagtgaaac cctgtctcta 23700 
caaaaaacaa caaaatcagc tgggcgtggt ggtgcacacc tgaggtccca actacttgcg 23760 
ggctgaggtg agaggattgc ttcagcccag gaggtagagg ctgcagtaag cggtgactga 23820 
atcactgcac tccagcctca gcaacagagc aagactcaaa aaaaaaaaaa aaagcaggcc 23880 
gggtgtggtg gctcacgcct gtaatcccag caccttggga ggccgagcgg gaggatcagg 23940 
agatggagac catcctggct aacacggtga aaccccgtct ctactaaaaa tgcaaaaaat 24000. 
tagccgggcg tggtggcggg tgcctgtagt tccagctact caggaggctg aggcaggaga 24060 
aaggcgtgac cctgggaggt ggagcttgca gtgagctgag atcacaccgc tgcactccag 24120 
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gcttctcatc ccagcacagg tagggggtgctatgggaaag ggatcctcag ttggccctgt 24360 
cactgctcta tcagctgggg acgtggcatc ctagtgaaaa catcatggcc gggcgcggtg 24420 
gctcacgcct ggaatcccag cactttggga ggctgaggag ggtggatcac ttgaggtcag 24480 
aagttcgaga ccagcctggt caacatggtg aaacccatct ctactaaaaa tacaaaaatt 24540 
cgccaggtgt ggtggcgggt acctgtaatc cgagctactc gggaggctga ggcaggagaa 24600 
tcgcttgaac ctgggaggtg gagcttgcag tgagccgaga tcttgccact gcactccagc 24660 
ctgggcaaca g a gtgag ac g ctgtctcaaa atctcaaaca aacaaacaaa caaaaaacaa 24720 
acaaacaaag cgtcatttat ccagcacccc tggggaacca tgctacctgg tgttttatgg 24780 
tacctggcaa ggtgcaggtg aagttgctgc tcttgggcat tgaacccgtc ttgtttgggg 24840 
cagctcaggc cccaggcagg gtccgggttg gctctcgttg gtgtggccct ggcccatcca 24900 
gacctatatt tctgccgtcc tgcaggtgat caatgttgat gggacgaaga ggcggaccct 24960 
cctggaggac aagctcccgc acattttcgg gttcacgctg ctgggggact tcatctactg 25020 
gactgactgg cagcgccgca gcatcgagcg ggtgcacaag gtcaaggcca gccgggacgt 25080 
catcattgac cagctgcccg acctgatggg gctcaaagct gtgaatgtgg ccaaggtcgt 25140 
cggtgagtcc ggggggtccc aagccatggc tcagccatgc agacttgcat gaggaggaag 25200 
tgacgggtcc atgcctgggc ataagtgttg agctcaggtg ccccgacctg gggaagggca 25260 
ggacaggaaa ggtgacagta tctggccaag gacagatggg aagggaccaa gggagctgat 25320 
tagggagtgg ttatggacta ggaatgtcgg taacaatggt tagaaagtga ctaacatttg 25380 
ttgagcacct gctgtgtgcc cggccctggc cgggagcctt cgtgcccaca gtgaccccgt 25440 
ctgcaaatgt agttccttgc cctactcgca ctggggagca ggacgcagag ccgtgcaact 25500 
cacaggtgcc aagctcagga ctccctcctg ggtctgcctg ggctgggctg tgcttgttgc 25560 
ccctgtggcc cacgcatgtg caccttccac ctgaaagcca ggatcttcag gacgctcccc 25620 
gaggaggtcg ttgtctggca caatgatttg tctottcctj^aaaaggtgac agagttacac 25680 
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cctgggcgac agagcaagac tccatctcaa aaaaaaaaaa attaaatctc aaaaaaaatt 24180 
acattaaggc aaactaaaag atgtttaaaa tatatatatt aaattaaata cactccaata 24240 
gagcaaatac gaaaataccc agaaaacaca atccccgcac ccccaggaca acctcccagg 24300 
gggtccacag caagagaccc caagcacgag agacagagaa cagtgtccct gtggcggaac 24360 
ctctggccca tcaggctcta ttagaaaata aggctcttgc cactgagaga aagaggcaca 24420 
gtcgcccagc agccacgggc tctggcacac cacgagtcag gccagcaaag tgtcaactgc 24480 
cccctacaag gtgacaaact aggacaaact ggaaaccaga ggctggacct ggagcacagg 24540 
gaccaccaca tggggctggg gaatgggcag ggacctcaga gcgccaccca catgcctaag 24600 
agcagcgcgt atgcgcatgc ctctgcatgg cttagggaca cagggagctc cccccacccc 24660 
caacccagga aggcagcccc cactacccag gtagggaacg gataggacca gcaccccgtt 24720 
ctgctcgtaa ctcagggctc caggccccct cgggggcaac cagcacagag ctcagacccc 24780 
aaatatcttc acccacctcc tggtccccat ctggacaagg gtgctgggga ctggctctca 24840 
gtcacaccct cggggtactc ttcaaaggac agctggatgc cccagggcag gagcttttgg 24900 
cccccagctc cctcacccca gacaccagct cttgggaccc caccagcatg ggcaaggtgg 24960 
acaccatcgt cccgattttg cagatgagga aactgaggct gagggctggc acacggctct 25020 
ccagagctga agagaatgca gagagcagcc ggagccagcc ggtgggtccc tgaggccggc 25080. 
tcgtagcaag ccacagctgc ctccgcccat cacacttgga cctcactggc cccaggacag 25 140 
ccctccaggg cggcctggca cagagcccac accctgctgc ttcctgaaca aataagtgaa 25200 
caaggccacc aagccgagga cctggatgta gccccggctc ccgccagggc ctccccaaca 25260 
gactccccat ttggagagcg cattaagtgt ttccaaagcc tcacaaacca cagatgtccg 25320 
gctgtctcac ggcttctgta acctgaactt ggccctcact ctgccctccc agcactcctc 25380 
tcagggccca ggcccctcct ctgagatgcc agcactgact ccccaacttg tccccatcac 25440 
ctggctcgtt cctgaacctc ggcaggagag tctcaggcca gatcctccca ccagccacct 25500 
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ccaccaggat gcaggaggca tgagacctgc tcgtgccggc tgggagatgc aaccaaccaa 25560 
gatcaatcca atcagcggat gaaetgacaa atataatgtg gtccctccac acaatggaat 25620 
attattcagc cacaaaaagg gctgaaatag gccgggcgtg atggctcaca cctgtaatcc 25680 
cagcactttg ggaggccgag gccggcagct cacttgaggt caggagttca agaccagcct 25740 
ggccaacatg gtgaaatccc gtctctacta aaaatacaaa aattagctgg gcgtggtggc 25800 
gggcacctgt aatgcaagct acttgggagc ctgaggcagg agaatcactt aaacccagga 25860 
ggcagaagtt gcagtgagcc aagatcgcac caccgcactc caacctgggc aacagagcaa 25920 
gactccattt caaaaaaaaa ataaaaggct gaaacaccca tacgtggtac tacttggatg 25980 
actcctgaaa acgttacagt aaccaaggaa gtcagccacg aagacgcatt gtaagattcc 26040 
cttcatgcaa aatgcccaga acaggcagaa ccacagaggc agaaagtcga ctggtgttca 26100 
ccaggggatc cggggagagg gaacgggaag tcaccgtgta atgggtatgg gttttatttt 26160 
ggggtgatgg aaatctctta taacttgata gaagagaggg ttgtaaacac tgtgaatgta 26220 
ccaaatgcct gccttctata ctttaatatt ttatattata taagtttcac ctcaatttaa 26280 
aaaaaaaaca actcgacacc tttcacctag gaaagatctg gctttagctt gcatttcctg 26340 
taactcctgc ctaaagcctt ccagaagctt ccgctgcctt gtggatcaca accagactcc 26400 
acaccatgat ctggcctcta agggcctctc gcaggacacc ccgagggtga aggagcaccc 26460 
gtgggcccac ctctgcatag ctgcaaagct tctttccctg tcctcccctc tacatgggaa 26520 
gctctgcccg caggggcggg gccttatctg ccattctatc gcactcaacc ctagcacttc 26580 
actcggtagc agacaccaaa gcaaaacagc aacagcatta taccgggcca ggtgcacgtt 26640 
aactcactga attcatggta ggaaggattc tattcccatt ttacaggtga gaaaactgag 26700 
gcacacaaag gtagcatcag cttcctaagc ctcccagcac aggaagcggc caggctggaa 26760 
tcagaccctg ggcgcagggg ctctgtccac agtgctaact aactactcct gcccccgagg 26820 
gctgcagcgg tgagtgagtg agtttgtcag tggactggat gtccaaggtc atacaggaaa 26880 
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aatccagact attgtaataa cagcctctag accggctggg gccagaaaga tcgaggacgc 26940 
tgacacacaa ctgcgctcac tgcagctctg ccagggatgg ggctaaaggt ctcacacagg 27000 
gcagttaggg ctccccatag cctgggagag gaacggggtg agataacaga aactaggtat 27060 
ggtgcccgaa gtcaaacagc cactgagcat gtaaacccag gtgggtctga ccccaaaccc 27120 
ctccaccccc atcagccctg caacccgtcg ctgcaaggga gaaagcaact cagaggcctc 271 80 
acctgcctac atcccccacc cgtgtgtgtg agttctacta aatgcctgag cagtgacaca 27240 
gcacggctga aattaaacgg gttccaaaaa cgacaggaag cacgaagtga atctccccag 27300 
gaaagtgctg aacaaatgct ggatcgggtt caccggcgaa tttcttggaa ctgaagaggg 27360 
gagctaaaca cacggggccc tgctttggag gggactctct cagggtgctc cacacagcac 27420 
ttggttaacc ccactcagcc cttctgggct ctcccagagg gcccggcctt ggccttgggc 27480 
atctacagga ggaacctcca gggggagagg gggtgcctgg acaggccggc cctggaacaa 27540 
gcacttgggc cccgaggaga gaggactagg gcttgggagc tggggaagtt ctcagcactg 27600 
ggaccactag aacaaagcca tttccgtgcg ttcacagctt ccaattgcaa caggaagcaa 27660 
tcaggaaaaa taattagcgg cccacttact ggcttcgctg aggtccgagg catgtatttc 27720 
acacagtaaa accagggata taacatcaaa accgttctgc agaaagattc ctccctttcc 27780 
ttccatttta ggcctggatc accacattca ctggggctcc caggccttgc tgcctaatgt 27840 
taaaataatc aactctattt ttgcctcaca cacaactgaa ctctacagct ataattcttt 27900 
ctcctcaggg gctcgaacca catggacgac aggcatttga ctccagcaac atcaccccaa 27960 
aacgtgcaca aaacccaaaa ctgcaatgag gtgaaaggca acgcggtcgg cctagaaacc 28020 
ccccctttaa aacaaacagt ttccccaaaa ccccttttgc ctccttgacc caggcatttc 28080 
cggaaaaagg agcggcgctg gcctgtactc cccagatact gtcgctgttt tgtcttcacc 28140 
ttgttttgct agctccagac aaggccccac aatgtaaaca cgctcctgaa agaggcagat 28200 
ttggggtgaa actgtccata gaatctctag gcttgggtca gaggcaggag gacgtgaaac 28260 
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aaactccaag ctcctcctgt tccccgctgt cccccacacc tccaagcaga ggctgcagcc 28320 
tgggggatct gactacaggg ccaccccgct gcaccattca cactggaaat attcagggag 28380 
acagctgttt gccttaagga ggcccagaca aaggggcccg aggtcctccc cgctaaactg 28440 
ccacaaacag aacaggagcc gcggcgtgca caggcacttg cggccgtgcc acttggccag 28500 
ccatactcca gaaaaacaaa acacgcacat ccgaagagaa tgatttaggt agcaagaggc . 28560 
ttgcttgaaa aaccacatgg caatctccaa attaaaagaa catgtgtagc gtttcacgac 28620 
tgcttaagtt tcctgagtcc tcctgacctc aactccaccc cctgggaaac accaaaagtt 28680 
ggatgagaaa gttcccccgc cctacctctc cccacgggag tgtacaactg aggcacaagc 28740 
ctgcctcccc cactgccccg cgatctggga ccacgtctcc tccgcgtagc cgacccgggg 28800 
atggacacta tctggggacc cggcggccac acggggcatt cgggtcgccc gggcacctgg 28860 
caggtgtcag tccgcttgga aacccacagc cacgcggctc acaggagcag cgccaccggc 28920 
taggccgccc cgcgcccggg ctcagaactt tctcgctgcc acttcagccc gtcctcggag 28980 
cacgcggggc ggccgcgcgg ccgctggaaa caggcttgcg aaccggctcc ccgggccagg 29040 
cccgcctccg cgccccaagt ccccgctcgg tgcccggccc gggccacacg ggcccagcgc 29100 
gggctcggct cggctcccgg cttcccgcgg gctcgggcag gtgaggaccc gcccgcgccg 29160 
cacctggcgg agcgggcgcc ctcctcgcca gcccgggacg cagcgtcccc ggggagggcc 29220 
cgggtgggga gacaaagggc ccgcgcgtgg cggggacgcc ggggacggca gggggatccc 29280 
gggcgcgcgc cccaactcgc tcccaactcg ccaagtcgct tccgagacgg cggcggcgcc 29340 
cgcgcacttg gccgcggggc cgcccgggcc attgtccgag caacccgcgg cccgtcttac 29400 
acgccgggcg cgggaaggta tcgaatcagg 29430 



<210> 8 
<211> 33769 
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■1, 

<212> DNA 
<213> Homo sapiens 

<220> 

<221> unsure 

<222> (33739),(33749),(33758) 

<223 > Identity of nucleotide sequences at the above locations are unknown. 



<400> 8 

cttcccctta cactggtcct tcgacccgcc tcggatgaaa actgaatggg tttagcctta 60 
gaggctctcg gtctctaagg gaggtgggtc aggatgccgg ggacagggtc ctcttcctgg 120 
ggcaacgtgg gggaacgagc cacctacccc tccactgaat tgccctgggg tgtgggtacc 180 
gacggctcat tcggtgtcca gggtctgaga tgtgttgaca ggaagaatga aaggggatgg 240 
gagggatggg gcgaaagaag ccacctgcag ccccaggaac tatctggcca gcacaccgtc 300 
acccagcggc ctgagccacc cctgccagag ccaggaggag accctgccaa tgggtcacca 360 
gtgtgcagga actcagaagg tcatcacagt taataccctc catgccccaa tgtgggaaaa 420 
caggtttttt cacaacaaac aagataattt ttgttatttt ggcaaaagga ggcagggcag 480 
ccccggacac ctccatccca cctcatcacc cagccgcagg gccccggcca tccctgcaga 540 
cagagtggat gtcacaacct ccctgcaccg aaccaagtgc agctcccagg ccacaggcca 600 
cccaggaaag gtccagtggc ccccggaggc tcccaccgca ggcctcccac cacagccggc 660 
accaacccag gatagctgtg ttctcctggc ttcttttcac acgggtagca gaaagctgag 720 
atccggggaa agctgagatc cagggaaagc tgagaatcgg cctctgctgc ccggacgccc 
acccccagct ctgctcccag ctccagggcc tccttctcag gtgcccttac aggaggcaga 



780 
840 
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gggcttgagc cacctcctgg gcctggggca cgcaggatga acggggtcac ggtgcaggcc 900 
actgtccact gcgcagatcc caaggccata aacagcctgg ccacagtggc ttcccagctg 960 
gcaggcggcc agattatttt tgttgtttag caattgatta agtttctccg ctgcccccag 1020 
gggtaagtgg tggggcaaat gccgcaaccg cagcatttga cccgggatcc tgtgccaagt 1080 
gaccataggg tcacaaagca caagggaagt ggctgggccc gatgctggct ctgctggaac 1 140 
ctgaggccgg ccactgtcac ctgcacggtg cctgggacct tccagcaagc acagagaagc 1200 
tatggccctc caggagcagc tggcaggcac cttggcctgc agtcaggggc tctgtctgct 1260 
cagctctaaa acaggaaagt cgctgctctg cctggggtca gggcagccag agagtgacca 1320 
agtcagtgcc ggcctcagga agggacctgc aggcgggtcc cttcctctcc catccctcgg 1380 
tgccagccag cccctcctgt ggccccccac tgcctgcctc tgcccccatg ccccaccaca 1440 
acctcaggcc catggctgca tggccactcc ccaggcaggc agtggggatg ggatttcacc 1500 
atgttggcca ggctggtctc gaactcctga cctcaggtga ggagttccta aagtgctggg 1560 
attacaggcg tgagccaccg cgccagccct ccctgtggta ctaaacactc acaccccctt 1620 
gctggggacc ctggtgaggg aacacagcct cacaagtgaa gtgtggtttt gttgagcaaa 1680 
tgacgcctgg gcagccctct catctttgcc taaaactgaa gaatttaggg gcgtggatgt 1740 
ataaaacagt tggtgactta aatgaaaaag aaggccacac tccccccttt aggcaggcgg 1800 
cctaattctt taaaagccag cacagggtgc ctttctgaac ccaggcacac agtaggtgtt 1 860 
caatggacag cagcggttac ttgtactgct catgacaccc tgtctgtggc ctctgcagct 1920 
ggctccagcc tgacgcatgg ctgcgcccct ccgcaaggcc accccggtat acatggaaac 1980 
tctgtggaga aggccttggg ggccggccag gacgccaggc ccagatccca tctgcgccct 2040 
tcctccatag acctcagcga gctctcggca ccatgtgcct caggcccatt taagaagtag 2100 
ggccggccag gcatggtggc tcatgcctgt aatcccagca ctttgggagg cccaaggtgg 2160 
gtggatcacg agatggtcag gagatcgaga ccatcctggc taacacggtg aaaccccatc 2220 
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tctactaaaa atacaaaaaa taagccgagt gtggtggcgg gtgcctatag tccaagctac 2280 
tcgggaggct gaggcaggat aatcgcttga gctcagcagg cagaggttgc agttagcgga 2340 
gatcgcgcca ttgcactcca gcctaggtga cagagagaga ctctgtctca attaaaaaaa 2400 
aaaaaaataa aaaaaagaag cagggccagc cacggacgac ccctcacaca gctcccagga 2460 
cgcgtgcctg ggtatagggc tcaggaccat gaccgctgca gtggccccca agaaacgtta 2520 
cttttgtcac ccaccccgcc tcagtggcag tagccaaaat aacggattag aatggaacca 2580 
tgtgacaatg ccactgcccc aactgacaga agatggctat cagcagttca cgcggcccca 2640 
cctatcacaa gtgcagggca ctctacaact tatgcatcct tccccagaca ccgtcctttc 2700 
gaccctccca ggtcagcaag gcacacaggg cctacatttc acagccacac agcagagggc 2760 
tgaggctgga actcggatgc tctgatttcc gttcaatcac atccccagag gtggcacaga 2820 
gacggggggc ttctcttgac aaagtcaaga aagtcactgc cagctccact gaagaccaaa 2880 

gaacctcagc tctcaaaccc tcttgaaggt gttaccgaac tctcccagcc tgtttcctgg 2940 
gtcccgatgt tggtcccgtg ggacacagga agaggaagaa gctccctaga gcagagcctg 3000 
gtgcacctgc cacactctca gagggctgcg cacgggcgga ggagccgtgt gcaggagtgg 3060 
ggtctggatg gaggggcgct gtggccgggg gcagggggca ggggaagggt gctccaggtg 3 120 
gtgggcacag cacgagcagg ggcagggagg tccacactca gatgtgcaca gggagaaaca 3180 
aatcgtgcat ttccattgga ataggcggta aaaggtagaa aaacagagtg ggggccagga 3240 
agggagtcgg agccttctag tgtctctctg caggtgagcg gcagcccgag gtgtcagctc 3300 
agcagacttg gggtccaggg gccgtgtctt ctatcactga ccccagggca cacggaactg 3360 
gggagggaga gcagaggcac agggcacggt cagtgaaacg aaacaaggag tcatcaccaa 3420 
atgcggaaag ggcaaggagt gcccgcagcc gcacaagggt tctgtctggg caacgtgggc 3480 
gtcccaccag gccccgcacc ctgcaagcgc aaagctcgcc actgaagata aagggaagct 3540 
gttggagctg cggagctggt ctggggtccg catggagctg ggcttatgct gcagtcacaa 3600 
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gggggacatg gaagaggctg caggggacaa aaccagtgac cacagtctaa ctctgagcct 3660 
gtggaaaggc gcccacagca ttcacccatc ccagagatgc cattccccct gtgcccccgc 3720 
tccacggtga cagcgttctc caggaatatg atgcgcccct ctcctcttgc atcagccctg 3780 
acagtgagta ttcaggccaa aaagcagaag agcacagctg cgtggttcca tttccatgta 3840 
gttctggaac aggcaacgct aatccaaggt gatagaagtc aggagagtgg tggagggggc 3900 
gggggttgag gatggcaaag gggcaccggg aactttccca gtggtagaaa tgttctctgt 3960 
ctggaccgtg tggtagttat gcagacatat gcagctgtca aagttaatcc aaatgtacac 4020 
gttaaaatgt gtgcgtttta ttgcctgcaa gttatacctc aattaaaaaa ataaagttag 4080 
cactcaggct tcttccacaa cttcctgaac cgtgtgagct gattttcttg ctattaaaaa 4140 
ttcacggtcc atggctgaga acagcagctg ccttctgttt geaaagtcaa cgccaatcac 4200 
tgcccggccg cggcagactc ggccccacag gacctccttt cttttttccc tttgacctac 4260 
ttccctgata agtgacaaga cagccagact ctgggaacaa acgcccgtta ttcggccccg 4320 
agctgagcgg gccctgcttc ctgagctaat ccgcccggac agacggaggg acgtgagggg 4380 
ctttgccgtc ggctccagct gtcagtctgc ccgtcagact cgacagtggc cccctctgtt 4440 
cctcccgctg cccccactcc atccccgact tctttttgtt tcctgtccct gacagacgaa 4500 
catctgttaa aactctgtct gggtgagctg tggccagcgg cccacaaatc cccaagccgc 4560 
accccagcct catctgggcg ctgccgggag cactgcctgg ccaccctctg gacatagctc 4620 
tgagagccac cggccagggc acgtgtggcc cgagtggcat ggtgcacgcc gctaagccca 4680 
ctgcccaaag gcccccaagc aggagggatg tgcaggagac aaaagtcaaa agaacagggg 4740 
cacgttccac agaggatggg gctggagggg tggcagtgag gaacagcagc ttccgaggat 4800 
ggcggtggca actcccaaat aaggcctcac tcctgctgtt tttagctcat tccacataat 4860 
tggaaaaaca tggcagaaac cgaagccagc tgctgccttg gtcctggggc tgtgtggagg 4920 
gggtggggag gccggaggcc caggctctgc actcgactgc tggggatgag agtgactctg 4980 



5160 
5220 
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agctgcagag agcagcatcg cagccgccat ggtcccattg agccccggcc acgctgggcg 5040 
gcagaggctc gtgggatata cctgccctgt ctcatggggg tcacttcagg aggggcgggg 5 100 
gagccaggac acagcccagg gctagcggtc accctgcagc tcaggggcca cgtaaatagt 
gccaccttga aggcacacag cagtgcgggg ccccccccgc caccaacgca tccctacctc 
taggaggccg cctgtgtgcc cctgggaacg ctgctccctg tcccttgggg tcctggtgtg • 5280 
accaccctct cagccccttc cttggggaag gcacctgact ccctacaccc agctggcttt 5340 
catttgctca aaatcaggaa aaagcagaat tcaagacatc acagaaatgt cttcgcctgt 5400 
aactccatga aagataaacg gtcagacacc caggagggag tcccagggac ccttgagtct 5460 
cacctgaggc tctggcttca aacctcgaga tgtttccagc catgctagcg ccgcccccca 5520 
caacctgccc cacacagtcc tcccttggga actcacagat ttggccccca cctgccccgt 5580 
ttcttctggt ggagtgggtg cgttgggttg gggtggggct ggggactctg gatgtgtctt 5640 
aagagtctga gtgattctga cacagccagg ccctgccccc ctcctgacct tcgccccaca 5700 
ggaaagggag ccacacgcct gaagcgccca gcacaccccc ctccgtcctc cccaggtcac 5760 
ccgctggccg tgtgagccgt gctccccact gccccttcac ccaccccagc tcctcctggc 5820 
agcacccagc cttggaagct acttctgatt acaaccgccg aaggaagact cgctccctcg 5880 
gcactgaccc agacagcctg caccatcacg ctgctcagca caacccacac agccttcctc 5940 
caaaccccat ggagcgggga gtataatcac cccctttcta ccaacggaca aactgaagca 6000 
cagagaggtt aagtcacttt cctaagctcc caacacgatg acaaaaaata gaaggtcagc 6060 
ccgcaagtgg aactaggtgc tccaagtccc cggtctgcct gacactgcac ctcctcgccg 6120 
ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 
gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg • 
ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 
gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg 



6180 
6240 
6300 
6360 
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ccacggtccc gggtctgcct gacactgcac ctcctcaaca ccaccacggt cccgggtctg 6420 
cctgacactg cacctcctca ccaccaccac agtcccgggt ctgcctgaca ctgcatttcc 6480 
tcatcaccac agtcccgggt ctgcctgaca ctgcatttcc tcatcaccac ggtcccgggt 6540 
ctgcctgaca ctgcacctcc tcaccgccac ggtcccgggt ctgcctgaca ctgcactttc 6600 
tcaacaccac tccttggccg gctcccaact acaaaccaag ccatgtcttc catcctgaat 6660 
cctcttggcc taaacatcac tcacaatgcc tccctcggga acaggcacaa gtcccaccag 6720 
cacagcctcc ttcgttacct gcgtttccgc tagcccaggg ccagctccag agccctcacc 6780 
acagagcctc tatccttcac ccccggacac tggacctcac caacccatag cctggaggag 6840 
atccctgtgt gaccccaggg cctcctctgc ccgactctga atttcactgc ccaacgtgac 6900 
acctcggaag gctctctggg cactggcagc cctccatggg caccgctcct tctggccagc 6960 
tctgacatcc cggctggtga ggtgccctgc acgaggcctc tgcccactgg gacctcacag 7020 
ccgtgctgtc agctgcaaca agcgacagaa tttcacgttt tcttcacgtt gcccctgggt 7080 
gagcagctcc aggtagtttt cagtcgaggc gaggcgtccc gtcagcagcc aggcggcaca 7 140 
gctaattcat gcccgccggg cgcacggccg caataccaat gggcacctgc agcctggaaa 7200 
gccacagagg aaccgagaac agcgactgtg ctcaggtgac aggactgtgg tcttttaaca 7260 
aaacattttc ctttaacgtg atattttacg gcaaggaatg aaacctggag ggcaggacat 7320 
ttggatacta aagccccagg ctgccgcgtg gtctgctttg tgaagtctga agcccgcgcc 7380 
ccattctggc cccgctcaca ggtccggctc tgactcacca gcttcaatgc taggccgtgc 7440 
ctgtcctcca accagaacat gacttcctta aggacaaagc cgtttctcgc ccatccccat 7500 
ctccctctgg attaagaaat atgggaagat cttctagaac cacctcaaat ttgcagagag 7560 

ccatcctggtgacaaacccttgaaatgcttctaagaagagtttaggtttcttctcaactc 7620 
taaaacctct agaaaactct atttccacac cagctgcccc tggaacactt cagcttcaaa 7680 
agggcccagg gcagggagac ggaggagcca gcatccacac cgagcaccag cctgttaatt 7740 
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aacgggaagc gggtggggcc catctccagg cagctctgag gtcagactgg ggaaccatgc 7800 
ttacaaaaaa aagtgaactg aaacgctcac gtcctcatgc aaaaccagac tcccagttgc 7860 
atctttctgt ctcattgagg agctttttcc tccctttgac agaacaccct acacacggca 7920 
tctggaacca aagcagaaag attcaggctc agagtaaaac agtccccaca ctggctgcat 7980 
gtggacgttc ccggcccaga gtctcgccca agcagggcct ataaatgaca caaaatgttt 8040 
ttctcctgcg tgccagtcat gctccaactg agttatgtgt aaaagtgcct ctcacggctg 8 100 
agggcaaaaa cagttcccac aagactagag aaaggtgacc cctgacggct gagtctctag 8160 
ggagcgtgga gctgcgtgct cagccctgcg gccctgacgg ctctggaatg gaaaagctat 8220 
ccaactggaa gggcagggct cgctgctagt ccagcggtcc aaccccacag gtgtctgtgg 8280 
tgtcagctcc atgccacaga gcccagggct ggggccagag ccaccaggcc ccctgccagc 8340 
ctgcaggggc ctcctcctct gggtagccta accaccccct gtgagcgcag gcagcctcct 8400 
ctaatcacca cagggcctgt ccccccctct cccccgcttg caggaaaatg agccctgagg 8460 
actccccagg gctgctctgg gcctggacat ggagactggg aattacattt gcagaaggag 8520 
cgcaatgccc ttgaagggct cagccacgag cagccagtcc ccagggctca gaaggcccag 8580 
ctgttagaac cctgggagcc agcaaagagc caggggctcc acctaagtct atagcccctg 8640 
cctcttctgg ttgggaaaga aatcaacgcc cctttactgg ctcccactga cagcccactc 8700 
ccccaggtat gggaggattc tgggacgatg caggcaaacc tggaccctga gtgaacctgc 8760 
cccagctctc acgggcctgg caccagccac agcacctaag gcgccggtca tggtgacaac 8820 
atgaaggtga taagggcatg gacagtggac atggcagctg gacactgggc acccactgga 8880 
tgccaggcac ccagcacggc tccgtcaccc ctggatgagc agtggccctt tgcaagccag 8940 
ggtagcctgg gcaagttatt tgggggtctc caagcttgtc cagctgtgcg acttcactga 9000 
gccatgagtc tgggatttta tcagggccca cacccgttcc tggaactctg atacgtgagg 9060 
gagccacaca gggaccctta acaaaagctc ccagggcaac atgttctctt gcctcagtct 9120 
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cccaaatagc tgggattaca ggcgcacgac taccgcccgg ctaatttttg tatttttagt 9180 
agagacaggg tttcaccatg ttggccaggc tggtcttgaa cccctgacct caaatgatcc 9240 
ttccactgtt agggcaaggc acctgacagg cacgactgca cgatctgctt gttgggggct 9300 
gtgtccattc cccactcctt cgacaaatgt ccacacccag ccttgctttg acaccccaag 9360 
aacagagatg gtgacacctg cttcctacat gcccattgct ctcccaaggc agacatcccc 9420 
agcagatgca acacagtgtt taggcagaca tcaccaatcg atggtggcaa cagacaccag 9480 
gccctgctcc ctctaactcc agtggccagg ccccaagcca gctctcacct gcccactccc 9540 
aacccacagc agcaagactc agaaatggca aaaacacaaa gagaacagaa acgccccata 9600 
gcgggaggat gactaaaaga catgtcttga taagatattg ttcaggcata ggccaggcac 9660 
agtggctcat gcctgtgatc ctagaacttt aggaggctga ggtaggtgga tcacctgagg 9720 
ttaggagttc aagaccagcc tagccaacat ggtgaaaccc catctctact aaacatacaa 9780 
aaattagcca gacatagtag cgggcgcctg taatcccagc tgcttgggag gctgaggcag 9840 
gagaattgct tgaacctggg aggtggaagc tgctgtgagc cactgtactc caacctggac 9900 
aacagagcaa gactctgtct caaaaaaaaa aaaaaaaaaa gatatccttc actaaaactc 9960 
atgtctttga tacatattta cctcctgcaa tcgcaaatgc ttctgcagtg cataaagtga 10020 
aataaatagc aggaagcctt acggttcgat cacccacaca gacacacagt cacatacagg 10080 
aaaaacgcag ggagggctgg ggaacaaaaa aacagaagat aaaatgtgga gacagacaca 10140 
ccaagagagt aagagaccac ctccagacct cccttcagct tctcaaacac acgagccggg 10200 
cccgttacag aatttgcggg gaccgctgca aaatggaagt gcagacagcc ccttactcaa 10260 
aaggtaggaa tttcaggtca acaacagagc tcacctcata tgactacaca ggtcacacag 10320 
cccgtgaagt cggtcccaac accagcatgc tcctgcctca aagccgctgc acgtgctgtt 10380 
ccttctcgcc tttccctctt ttagtccttc agatctcagg cctcctgaga gagacctctg 10440 
acctgccggc tcaggcggcc acacccccag tacaggagtc tccggctcag cccctgctgt 10500 
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gttccgtacc cgatccaggt ctgtcctatg tccatctgtg tgccggcttg cttcctgaca 10560 
tggcccccac cacacgtgtg cctcggggca ggggaacagg cccgtctcat taactgcttt 10620 
cttctcagat attttctgga atatttgtgg atattgggca acatatatgc tccacctttt 10680 
tcagactagc caggacgagc tgcatttttt tttttttttt tttgagacag ggtctcactc 10740 
tgttgcccag gctggagtat agcggcatga tcttggctca gtgcaacctc cgcctcctag 10800 
gctcaagcaa ttctcctgcc tcagtctccc aagtagctgg gattacaggc ccgtgccact 10860 
actgcccagc taatttttat atttttagta gagatggagt ttcaccatgt tggccaggct 10920 
ggtcttgaac tcetgacctc aaatgatcca cctgccttgg actcccaaat tgttgggatt 10980 
acaggcgtga gccactgcgc ccggcccgag ctgcctgttt tacacctttg ccatattccg 11040 
gtgattctct ctcccctccg tcccccggcc ctgactgtgg tggccactcc ctgccgtcat 1 1 100 
gagcccgtat gtcctcactc tttccctttc cgccaggact tcaaccaaca ctgcagagcg 1 1 160 
cagggtccag ctccagcact gagttcagcc tcttctcacc aacagacagg caggaaagaa 1 1220 
aacaaactct gagaaggcca aggttcccgg gcagccagca agccaagcat ccttctccgc 1 1280 
tgaggcttgt gcagccgagg caccccctcc tccagggagc aggcagcgtc ctggggcagt 1 1340 
ctgcgaggga gaccagggcc cttgctccac cagggcccca ggtatggggg cagcagcaaa 11400 
ctcatggctc tgggagccag accccacctg ctagaaccta ctatgccacc tgctgtgggc 11460 
aaccccaggc tggtgacttg ccctggcctc ctctgtaaac aaagggctca tccaacctgg 11520 
tcaaaccact cctccccttc aagggtctat aatcctccct taacctgctt ggtccaaacc 11580 
cctggtgtcg ccaggtcact caggaggcag ctcatctgga ctccttccct gggtccagtt 11640 
tctctctcaa cattgccttt gaggccgagg tgaacggtca acagcgaagg gccccagagg 1 1700 
tgatggagga gcgggtgtcc aagacactca ccctttctaa tgcactgact ccctcgtgga 11760 
ctcacttgtg ccgtctcccc cacccaccca gccccagagc ccagagtgcg agcgccagag 1 1820 
gcccgggatt ctgtctgcac cgcggggtcc ccagtgcctc ggagcaatgc cagcacccgg 11880 
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caagtgttcg acaaatgcct gctgaatgag caaatggatg gatgaacgaa tgaatgagca 11940 
agcagatgaa tgaatggggt gctgtccaga gccgtgagga ctaggccgcc caagtcccca 12000 
tttctcaaat tctccttctc ccgacttggg aaacaagatg cttggtcggg gaggctctcc 12060 
aaccatcccc tgcagcagcc ggcacagcgg acagaccctt tgatgtaaca gccatgtctt 12120 
cattaaagat gccctgctct cagaaagaga aagacaaata caaacctgga aaatcctcac 12180 
caaacgcagg acccctgcca gggagcagag aaaagaccca cacgccacgg gcgccacgac 12240 
cacacacaca ccccagccgc tgcacacaaa cacagaccct agccagcaag aacaggggga 12300 
ccaggaaact gttcctaaag tcaggacccc catgtgctca gacagcagtg agagcaagga 12360 
cacttctcca tccaccggat gccaggagag tccttttagg gggccccaca ccgagactct 12420 
gcccttagga ctgttcctga gtgtggaagc cagcccactt ggaagccccc tgccctcccg 12480 
agtgggacac cggcacagga agcaggccct gtcccccacc actttctgca agctgggccc 12540 
catcacgcta cagaaacggg gaggactggt cccagggatg gcgctttcct gacacctctc 12600 
gttaccccct cgcttgccag gccccagggt cagccccaga ggccagactg gctatcccag 12660 
gcccgggagc atccccgaag gcgagctgca tcctgaacgt gtgtgatttc ccgaagggcc 12720 
cgccccgaac cgacacctgg aaagaaagat cctcagccgg tgccccagag gagaagagcc 12780 
atgcctcact gcaacacagt cccaggaagc accaagtgcc tgaggaccaa ggcggagagt 12840 
aaaaaagtgg aaaatatctg gggcaaaaat aaaacaaaac aaaacaggat tgacctcctg 12900 
ggctcaagca atcctcccaa ctcagcttcc cgagtagctg ggaccacaga cttgaatcac 12960 
cacacccgcc aagtggatca tttcgaacgg gtttgccgag gttccttctg gggcaccccc 13020 
ggcggccgca acccattccc gccaggcccc gccccgcccg cccgccccgt cccgtcccac 13080 
cgcctcacct gccttacacg tcctgccgtt gtcctgcagc tgcacacccg tggggcaggc 13140 
gcatgtgtag aaaggctcgc ttggggacag caggcacagg tgggagcagc cgccattgtc 13200 
ctcctcacag cgagtgtgga ctgagaaaac caggacagac tgagagaagg ttccagaaga 13260 
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ggaccgtcac ttgtttctga atgagtcaca tcctgcctcg tcccccgtga cagcctccag 13320 
tgtgtccctc tgcccaaaca tcggcctcaa gtggcatcag ggacctcccc gcgggcacca 13380 
ttccacctgc ctcatcgctg gccccgtcca catggggccc tcagcctggc cagacggcct 13440 
gcaatttccc caaaaccagc cgtgaccttc ctggccaccc tcacacccag atgtgacctg 13500 
cccatggagt gacatcctcc ccatctgctt cctcccacca agctcctatg actagaacac 13560 
cctccccagc tcctcggagc ccccaaagga cacccctctg caaaggctgc cccccacgct 13620 
ccaatggccg gggtcaggac ctgcctgtgt ggtagtgacg ggaaccccag agacaatggg 13680 
ctcctgggca aaaggcttgt cttgtctttg tgctatgtgt ggacccagca gcttccatag 13740 
gaacactgtc cttcttgctg ggatggccaa gcttgtcact ctcccaagcc ctcctatgac 13800 
caacagcaat tgaacggaac tcgataaatg cttccagcac ctcattcaaa ccaggggaaa 13S60 
gctgggtgta gcagccccaa aatacggata taactggaac aacaaactca tcaaaatgaa 13920 
cctctccctc cctcatgctg ccccaagtgt agatgggttt tgtgaccacg actttctcac 13980 
caggaaacag ctccagagag ccccaccctc ctgtgtcctg ctctgggaac agctggcacc 14040 
cctaggcccc acatttcaat tcaaagtcca aaccttccat aatggcctgg ccagaaatct 14100 
ccatccctgg tccctgtggg agtgggccac tgtccccaga gccgcagccc cactgtcaca 14160 
gaagctggtg catttcccca tcagggacct ctgtcacaac ccagcgtggc ccccaggctg 14220 
agaactgctg attctgggca gattattcat tgataaatac gcgacttgca gggccaagca 14280 
tggtggctca tacctgtgac cccagcactt tgggaagtca aggtgtgagg atcactggag 14340 
cccacgagtt tgagacaagc ctgggcaacg tggcaaaatc tctcatctct attaaaaata 14400 
catacacaca cacacacaca cacacacaca cacatatata tgtatatata aataaccata 14460 
tatatatata cacacatacg tgtatgtgta tataaataca tatacacaca cacacagaca 14520 
acttcttctg ggccttgaaa acgaggcaac cttccttgga aatccccttg ccactgctga 14580 
gectgaaata gcccccatga gctctgcaga ggggtcctct gcaggcccgt gtcccccagc 14640 
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cagccacaca cctccctcca ttgcagcagg taccccttta gagagggggc cccccagagc 14700 
atgggcttct gcagggaggg gtcacctgcc cccccacccc acccacgccc gcgcaccccc 14760 
acgcccccgc atcctcccac tcccctgccc cgcgcccccg ctccccccag ccccctcacc 14820 
ctctcccccg tgccccaacc ggcactcaca aaaaggctgc cgctcctggc tcagcacctg- 14880 
gatgtccatg ggtgagtata gggcactcag gatctccttc ctcttccccc cagtgcgctt 14940 
gttgcaggca tggatggagc gggtctgcca gtctgtccag tacagagtgt ccccggagag 15000 
cgtcagggcg aaggggtgcg tcaggctgcc ctccaccacc ttctgcctgc agtcagggaa 15060 
gcggggtgga ggagccatca ggagggtccc ccgacagtca ttgctgctga cccaattaat 15120 
ttcttttttt ttttttgaga tggagtctcg gtctgtcgcc caggctggag tgcagtgatg 15180 
taatctcagc tcactgcaac ctccgcctcc cgggttcaag caattatcct gcctcagcct 15240 
cccgagtagc tgggatcact gatgcccacc actacgccca gatgattttt gtatttttag 15300 
tagagacagg gtttcatcat gttggcaagg ctggtctcga actcctgacc tcaggtgatc 15360 
cacccacctc agcctctcaa agcgctggga ttacaggcgt gcgccaccat gccaggcttc 15420 
ccatttgctt tcaaccagac aagtgaggcc aggtcaagag ccccaggagc tggcgccctc 15480 
gtacatttct cccggcgtgc acagggcacc tcccaaacac agcctgtgat ggtgacacac 1 5540 
gggctccccc aggtcaagtg gcaaagtctc ccccagggaa gaaaggagga agccatgcct 15600 
ggcaaaaagc acacctctcc tgcccaacgc tttaacctct gtatacaaat caggccatgt 15660 
gcactcgctc cttcttacaa tgctcataat ttatactttc agagtaaatg aaacttggca 15720 
tcaacccgag aaacagctat tcttttctag atgcttacag tgcccagcaa atgaggactc 15780 
gggtgtaatg agattatgga cactggaaac aggatcataa tgtgacgtgg tcggtaatgt 15840 
gcagttttat ttgcttaatg accctcgccc cgtgacaggc tccctgaggg tgggcctggg 15900 
ggcagaggtc cccgccacgt ccccagccct cagcacagtt gccaggagag ggtgacactc 15960 
atgaagtggc acagggaaga tgggagctgt gggctctgca gatccaccac ctcttctgtt 16020 
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catttttgtt gatgctgttt tttaagaaaa ttattgaagt aaaattcaca ggacatacgt 16080 
ttactttttt tttttttttt ggagatgggg tctcactctg tcacccaggt tggagtgcag 16140 
tggtgtgatc tcagctcact gcaacctctg cctcccaggt tcaagcgatt ctcccacctc 16200 
cgcctccaga gtagctggga ccacaggcgt gcaccaccac acccagctaa tttttggggg 16260 
gtatcttttt ggtagagaca gggtttcgcc atgttgccca aggctggtct tgaagccctg 16320 
agctcaggcg atccacccgc cttggcctct caaagtgctg ggattacagg cataagccac 16380 
tgcacccagc ctaaatttac cactttaaag tgaatagtgt tacctagtgc attcgcaagg 16440 
cggtgcagcc tccacttctg. tctagttcca aagcacttcc attgccccac aggcaaaccc 16500 
cacacccggc agcagtcatg ccccagtccc cgcccccagc cccggcaaac acttttgatg 16560 
gacttaacta cacacattct caacatctca tataaacgga atcacaatat acagcctctg 16620 
atgtctgtct tctttgactt ggcaccatgt tttcgaggtt catccaggct gtagcatgtc 16680 
agtgcttcat cccgttttag gggtgaacca tattccagtg tgcagacaga aaccaatctg 16740 
tgcatccatt cacccactgg gggacctttg tgtcatttcc accctcggct gttgtgcaca 16800 
gtgctgctac ggacattact gtccattcac attttgtgtg aagacctgtt ttcgattctt 16860 
aagagtatac agctaggagc ggaattgctg ggtcatacgt aaatcaatgt ttacgtctca 16920 
aggaatcaac aaactgtttt ccacaatgtt gtcttttttg tttgttttct gagacagggt 16980 
cttgctctgt cacccaggct ggagtgcggt ggtgtgatca tggctcactg cagcctcaat 17040 
ctcctaagct caatccatcc tcctgcctca gcctcctgag tagctgggaa cacaggtatg 17100 
taccaccatg gccagctaat tttctaattt tatttttttt tgtttttgtt tttttgagac 17160 
agagtctcgc tctgtcgccc aggctggagt gcagtggtgc catctcagct cactgcaagc 17220 
tctgcctccc gggttcacac cattctcctg cctcagcctc ccgagtggct gggactatag 17280 . 
tcaccggcca ccacgcctgg ctaatttttt tgtattttta gtagagatgg ggtttcaccg 17340 
tgttacccag gatggtctcg atctcctaac ttcatgatcc acctgcqttg gcctcccaaa 17400 
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gttctgggat tacaggcgtg agccaccacg cccgacctta cttttaattt tttaatttta 17460 
ttattttatt ttattttttt tttttttgag acagagtctc gctctgtagc ccaggctgga 17520 
gtgcagtggc gggatctcag ctcactgcaa gctccacctc ccaggttcac gccattctcc 17580 
tgcctcagcc tcccgagtag ctgggactac aggtgcccaccacgatgccc ggctaatttt 17640 
ttgtattttt agtagagaca gggtttcact gtgttagcca ggatgatctc aatctcctga 17700 
cctcgtgatc cgcccgtctc agcctcccaa agtgctggga ttacaggcgt gagccaccgc 17760 
gcccagcctt tttttttttt tttttttttt ttttgagata gagtcttgct ctgtcgccca 17820 
ggctggagtg cagtggcggg atctcagctc actgcaagct ccgcctccca ggttcacgcc 17880 
attctcctgc ctcagcctcc cgagtagctg ggactacagg cacccaccac cacacctggc 17940 
taatgttttg tatttttagt agagacgagg tttcaccgtg ttagccagga tggtctcgat 18000 
ctcctgacct cgtaatccgc ccgcctcggc ctcccaaagt gctgggatta cacgcgtaag 18060 
ccatggcgcc cagcccatgt ggccattttt cagtgagaga agccagaggc ccatcactct 18120 
cggttgctcc ctgggccatg ctctgcctca gccagaagca ctgagggaag gtcagcctcg 18180 
gcccttgccc cagccacagt cacagataaa ggggcctgca caggtctgtg tggctccaga 18240 
gctcgtcacc caacacacga cgcttccatg tgaatagccc caggtgcatc atgaagagcg 18300 
atggccgctg cagaggcaga agaatcccgc ggggaagcag gtgggagaga ggctgagaac 18360 
agaccagacc ctggagctac agaccctatg ttccaaccct ggctgggact agctgtgtgg 18420 
ctctgggcaa attcacatgc ttctctgtgc acaggggatc aaaatagcaa acacaggcta 18480 
ggcacagtgg ttcacaccta taatcccagt gctttgagag gccgaggtgg acacatggct 18540 
taagctcagg agtttgagac cagcctgggc aacatggtga aacctcgtct ctacaaaaaa 18600 
aataccaaat aaattagcca ggcgtggtgg tacgtgcctg tggtctcagc tacttggaag 18660 
gctgaggcgg gaggaacact tgagcccaag aagtcaaggc tgtggccgcg tgtggtggct 18720 
cacgcctgta atcccagcac tttgagaggc tcaggtgggt ggatcacttg tgatcaggag 18780 
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ttcaagacca gcctggccaa catggtgaaa ccccgtccct actaaaaaaa tacaacaatt 18840 
tgccaggcgt ggtggcgggc acctgtaatc ccagctactt gggaggctga ggcaggagaa 18900 
tagttagaac ttgggaggtg gaggttgtag ttagccaaga tggtgccgct gcactccagc 1 8960 
cagggggaca gagcaagact ccatcccaaa aaaaaaaaaa acaaacaaac aaacaaaaaa 19020 
agaggtcaag gctgcagtga accatgattg tgccaatgca ctccagcctg ggtgacaaag 19080 
tgagaccctg cctcaaaaca ataaaaatat aaataaaaat aaaacataat agcaaacgtt 19140 
tcatagaggt ggtatgagca ttaaatgaac tgataaacgt ccctggaaaa cagtaagtgc 19200 
tatggaagga ttcgctgccg ccaccgccac caccattagc atgtttcaac ctccatcacc 19260 
ctcactgtcc cctgtcacca tcctttgacc agggcactcc cagctgcagc ctttctatcc 19320 
tcttgtccac ccttcataac tgtaagatca ctcagctccc aagaaccaca gtctacaggg 19380 
taaccacatt tccaaatctc aaaccagacc cgctggtctg cacttccagg gacaacagga 19440 
tattttcaaa ccagcccaaa agagatgtgt ggctcagcat aagaggaaca ggagaaactg 19500 
aggcctcttg ccctgagaat gagcttggaa gtggatgtcc cggcctcact caaaccttca 19560 
gatgactgag gcccagccag gagcttgagt gtaccctcag gtcataccct gagccagaag 19620 
cacccagcta atccactcct catcactgac tccctcccca taaaaaacct gtttgctgtt 19680 
tcaggctgtt aagttgtggg ctgttttgtt acacagcaat ggataactaa cacacgaggc 19740 
ctggcaagtg tggagcaaag ctgcccaagc cctcaagtct gttcatgtgg gtgttggcct 19800 
gtgtttgcag aaatccagcc actgagtcct cccatgcagt cactactgcc ctctgcacag 19860 
acacctgcca catccctgcc tgggceagga gctccactag tgcaggaatg gggtctgccg 19920 
tcccaggagg atccctgaca cctagcacag ggctagcagc aggcagcact tggttagtga 19980 
ataaactgcc cttcacctgt acacagaagg gatgtttcta taaggggtaa ttaagtacag 20040 
agctgggaag ctatgctgac cagaaggctc taaaagcaat taaccaacga ggggaaaacc 20100 
cttcctactc attctcggcc cattttattg agcactgacc atgtggaagg ccccctggtg 20160 

BNSDOCID: <WO 0192B91 A2_l_> 



WO 01/92891 

PCT/US01/16946 

agactgggga atgcaccaat aactgagaca gcttccggct gttgccctca ggatgcctga 20220 
gctgggatag ggccagggtg ggggtggtgc gtgtgacagg gttactgttc acaaccctgc 20280 
cgggccataa gccctcccca acaattccaa aatccaaaac gctctgaaga tggaaagctt 20340 
ttgttgctca tctggtgaca aaacctcatt tggtgcatgg gccgggtgcg gtggctcacg 20400 
cctgtaatcc cagcactctg ggagccgagg ggaaggatcc cttgagctta ggagtttgag 20460 
accagcctga gcaacatgtg agaccccgtc tctaccaaaa atacaaaaat tagccaggtg 20520 
tggtggcgca ctcctgtagt cccagctact cgggaggctg aggcgggagg atcgcttgag 20580 
cctgggaggt gggggctgca gtgagctgag attatgacat tgcactccag cctgggtgaa 20640 
agagtgagac tctgtctcaa aaaaacaaag ttaaaaaaaa aaaaactgtg catgggtgtg 20700 
ggctacagat agtcttttct gccctactta gaatgaacgt gccacatttg ctatagaaat 20760 
attcaagggc tggtggcaaa tgccacacag accctgacgc tgttccaagt tctgagaagt 20820 
cctgcattcc tcagggcccc agagtttcag agaagagtct gtaggcctga gttaagaagg 20880 
aacgccttca aaagccctgg ggacaaaggg gaaaggggtg ccccaggact gcgtgggtac 20940 
ctaccggaac gagccgtcca ggttggcacg gtggatgaag ctgagcttgg cgtcagccca 21000 
gtagagcttc tgctcctcca ggtcgatggt cagtccattg ggccagtaaa tgtccgagtc 21060 
cacaatgatc ttccgggtgc tgccatccat ccctgcccgc tcaatccggg gcgtctcacc 21120 
ccagtctgtc cagtacatgt acctgtgacg ggggcagggc aagagaagca gctaacacag 21 180 
atctgttttt tgtttttgtc tgcatagatg cagacatgaa acaacagaca gtgaacttgc 21240 
cctaaaatct cacccatcgg aaataaccaa caggtatggt ttcaggtatt cctgccttaa 21300 
gctgggcaat caaaatatac tatttccaac ttgttctcag ttaacagtaa attctgggca 21360 
ccttcccttc ttgtggatag aaagattcct tgttcttttg atgattgcct agtgtactct 21420 
gctgtaagtt ttttaaagaa cttcaggtta tttctgattt ttttgctacc atgaaaatgc 21480 
tgtaaatgaa cctctaaaag gcaattcaaa acactcagga tggaatatta tttagtggta 21540 
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taaagaaatg agctatcggc tgggcccagt ggctcacacc tctaatccca gcactttggg ' 21600 
aggccaaggc gggtggatca cgaggtcggg agatcaagac catcctggct aacacagtga 21660 
aaccccgtcc ctactaaaaa tacaaaacat tagccaggcg tggtagtgag cacctgtagt 21720 
cccagctact taggaggctg aggcaggaga atcatttgaa cccgggaggg ggaggttgca 21780 
gtgagcagaa atcgcaccat tgcactccat cctgggcgac agagcgagac tccatctcaa 21840 
aaaaaaaaaa aagaaaagaa aagaaatgat ctatcaagcc atgaaaagac atggaggaaa 21900 
cttaaatgca tgttagtagg tgaaagagcc aatctgtatg agtccagttc taaacactct 21960 
ggaaaaagca aatacacaga gacagtaaag catcagtggt tgccaggagt tggagaggag 22020 
agggatgaat gagtggagca cagaaaatca gggcagtgga actatcctgt atgacatgga 22080 
atggtgggtg catgtcctta ctcatctgtc taaaccaaga atgtacaaat caagggcgaa 22140 
ccctcgtgta aacgtggatt ttgggtgatg gtgcgtcagc cagctttcat cagttgtaac 22200 
aaatgtacca ccctgcacag gatgctgaca gttgggaagg ctgtgtgggt gtgaggacag 22260 
ggatgtatag gaactcagta cctgctgctc atcaattttg ctgtgaacct acaactgttt 22320 
gaaaaaatta agtctattta aaaacaacaa aacatggcca ggcacgatgg cttgcacctg 22380 
taattccagt acttcgggag gctgaggtgg gtgggtcact tgagccaccc tgggcaacat 22440 
ggcaaaatcc cacctctaca aaaaataaaa attaaaaaaa agttagctgg gcatggtggc 22500 
acactcttgt agtcccagct acttgggagg ctgacgtggg aggatccctt cagccctggg 22560 
aggtcgaggc tgcagtgagc tgtgactgta ccactgcact ccagcctgga tgacagagtg 22620 
agaccctgcc taaaaaaaaa aaaaaaaagg ctgggtgcgg tggctcatgc ctgtaattcc 22680 
agcgctttgg gaggccgaga tgggcggatc acgaggtcag gagatcgaga ccatcctggc 22740 
taacacggtg aaaccccgtc tctactaaaa gtacaaaaaa aaaaattagc cgggcatggt 22800 
ggcggacacc tgtagtcaca gctactcggg aggctgaggc aggagaatgg cgtgaacccg 22860 
ggaggcggag cttgcagtga gccaagatca caccactgca ctctcagcct gggagacagc 22920 
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aacactccgt ctcaaaaaaa aaagaataaa acccatggct gggatggacc ctgaacctgc 22980 
agctgcagct gttcctgggt aggtctgtgg gcgacgtggc tttgcttctc catgttccca 23040 
agagacaagc atcacccatc catgagaaac aagcacatcc tcagggcgcc cttacgtgat 23 100 
ctctggccaa tgaaccaaga caaagtgagc agacaccagg tctgggatgg caggtcccac 23 160 
ccccaccagt gcccagtgtg ccctgtttgg aggtgaccac agggtgtgtg cccagaggct 23220 
gggcgtgact ctcagcggag accagagggg aaccacacca gcttggagga ctcagttccc 23280 
atcccagcca gctgggatga gccacaggac acaagggctg gcagacctat tgtgttttgt 23340 
ccacccttca cagcagagaa aggggacagt gcccagaatg tcctctgagg agcctcctcc 23400 
cactcttggt ccttgtaaaa tggtgctgac tcccttgctc ccttcttcct ggggtgggcg 23460 
gcaaacccca ttcccctcag ccttagcaag tgatttagaa acaggcagct cgcccaagcc 23520 
aggcatgaga gtgatcccgg gacacaggga gaacaagccc cgctttgccc tctgggggtc 23580 
tccattcagc agaagaggca aatgacagac acacagccgc ctcctccccc accatggtgc 23640 
tctgcagcct caggagcctc aggtgcacca agggccaccc catccagggg gccatgcttc 23700 
cttgagtggt atcgttcctg agcgagtacc atctccacct tccagagggg ctgtgacaag 23760 
atcaacaaga atgagggcat aggagcctcg aaccaaacat gccctcttcc ctgcagaggc 23820 
tgactgcgcc cagctgctat caccaagccc ctgctcctcc ggccccgtgg ggacagggta 23880 
agaggggtgt cacatggaac agctctccaa acagtccctc tcaagctgct gtctcctgtg 23940 
catctagtga gaacccaacc aacaaaggga aggtgggaat tgctattccc attaggcaga 24000 
tgagaaaact gaggccccga aaggctggcc tgttccaggt tacaggcgct gagcggctgc 24060 
tctgggaaca cacttggtgt ctgctgaggg cccgagcccg gccatcatat gactcaccct 24120 
tcgccagcaa agcccgggtg tgggtgaact tttcctggca gcctgggact ccaaggtgct 24180 
ggcagccagc ccagggaagg ctcccgcgtg cctgcggcag acgccttgct ttacctgcac 24240 
gtccccaccc ctaggagcct ggacagagcc cagaccctcc gccacctcct gagaaggtat 24300 
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caggggcatc agtctggact tgggggggaa tccacacagg ccttccccaa atgctccacc 24360 
gtggcccatg gaaaaggctg gaaaacgtgc aggagcagga gcctccgcat ggagcataat 24420 
tcacattcct tccccgagtt tcataacaga ggcctgctgg tttccttaaa tggggaattt 24480 
gcgagccagt cggtgaccag agactggttg gcgtggacgt gctcttgcag agtctcaaac 24540 
gctaccacaa gcccagccaa attccacgga ggaaaatcga cttccgaaga aaagagctgc 24600 
agcatggcct tcgtgcagag ccagctgcgg ttgtggttgt gtgttatttt agggaagggc 24660 
cattttgcat tttaaagagg gggttgggtt tcaccctggc tttaatttga gacccggggg 24720 
ccactgcagc cccttgtcag gctggtacag gccggggact cctcccatgc taagccagtg 24780 
tctttctggc cccagatcct caggggccag agggtcatcc ccagagcccg ctctgccacc 24840 
cacatgggta ccctgggcct gggagggatg tgccttccct caaccctgcc tggatgtccg 24900 
cacggggcca cctgcattgc tgaaactgea acgaagtcga gtctcaggag gggcccccct 24960 
ggctgcaggg ctcttgatcc ttttggccac gtgcacactg aggtggacgc tcggacccag 25020 
agaccccctt catgatgatg gccggggcag gaaccccctc ctctgaggaa ggaccctggt 25080 
gggggacagc actgcaggag ggcacaggag atgacggggg ctctagcagg gccgggagga 25140 
aggccaagat gctcctcgca accgtgtgcc tgtggccagg acagaggaca aacccaccct 25200 
ccactgtccc cactctcagg acagcagtcc tgccccagga ctcagcgccc acacttatgc 25260 
ctgaggacca ctattcaagt cagtatttgg cgagcagggg ttgctgccgc gggcgctgtg 25320 
acaggctgga atcctctccc tctccctctc cctctccgga gacatggagc ctacagggac 25380 
agagtcagca cctcagggta ggaccatggc tggcgtcatc agcatcactg gatctgatga 25440 
gtgggagccg gcatctcact gttttcactc tctcattcaa atgactggag caaagggaag 25500 
gtgtggggag aggcccagga atcaacacta aggtcaactt tgcccccagg ggcaggggtg 255.60 
ggagtgaaca gccacaggtg tgatcctggg gagggcttct gggagagaat tcagaggcaa 25620 
gcatgtagag gaaccatttc aaatagttaa gaaaagccag agccaaacag ggacagttgg 25680 
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ctcgcagaga tgatgcaggc aaagccagct cagatctgag catgggaaag actactccca 25740 
accaagggcc cagcatctcc caaccaagca ccaagtacct cccaaccaaa tgccaagcac 25800 
ctcccaatca aatacctccc aaccaagcac ctagcacctc tcaactggac accaactact 25860 
cccaaccagg caccaagtac ctcccaacca agtgccaagc acctcccaac caagtaccaa 25920 
ttacctccca accaagcgcc tagcacctcc caactgagca tcatgcacct cccaacagag 25980 
catctagcac ctcccaactg atcacctccc aacctagcac cgagcacctc ccaaccaagt 26040 
gcagagcacc tcccaaccaa gtgccaagca cctcccaatc aaatacctcc caaccaagca 26100 
cctagcacct ctcaactgga caccaacaac tcccaaccaa gcgccaagca cctcctaaca 26160 
aagtaccaat caccttccaa ccgagcacct agcacctccc aactgagcat catgcacctc 26220 
ccaacaaatc acctagcacc tcccgactga tcacctccca acctagcact gagcacttcc 26280 
caaccaacat agcaaaagcc ataaagaagt aaaaagacaa aaccacgtag gcatggagac 26340 
tggacttctg gtggcgagga aagggcattt ttattataac gacagctaac atttgttgaa 26400 
ctcacaaact gttcttggtg ttttcctcat gacatgcagc atggtcacgc ctctgtacag 26460 
acaaggatac tgaggcacag agtggcaccg tgccaacctt gtctcatctt tttatcgaac 26520 
ctacatgcag agtgccagca aatccagctg tcttttctct tcagaacaga tcccaaatct 26580 
cgccactcct tacccccaca agtgaggtgt ccccgctgct gctttctgtc gccaggatcc 26640 
cggtaataac cgtggagagg gctcctgccc ccacgccacc caccccacag ctcactctcg 26700 
ctccagccac caggggatgc cttccagcac gagtcagagc tggcacctcc tctgctcgag 26760 
acctcatgtg tcctctcctc acaccttggg ccctgtttcc ctacattctg ctacagcccc 26820 
tcaaacaggc cccgccccaa accagcccag ggcctttgca ctggctgatc cctctgcctg 26880 
gaccgcgctg cccccagaca gccacacggt tctcagcctc atctgcttcc agtctcgact 26940 
caaaagtcac caagaggcct tcccagcacc tgagctccga cggaagcccc tcgccacagc 27000 
acccaagcac tgctttatcc ccctacgcac acgtcccttt caaatactat tcatttacca 27060 
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tctcctccca ctcactgaaa gggccagaga ctgggctata cccgctgcgt ggggagcagg 27120 
accaggcgca agggctcaca aatgcagtgg atgcctggtt gggaggtgag ggagctgcag 27180 
cgacccacgc tgggagggaa cgcaatgaca ggaggagcgc aggtcctggc gacacgatgg 27240 
ccatggcagc cgctggtgag caaccgcagg ccggccctgg gagagggctt ctagcaagct 27300 
gctatcttca gcctctccga ctactgcaga tgccccctcc tagccagaga cactgctaca 27360 
ccagccgacc cttccaaaaa gaaggtcagt aaccccgcga ctcctggagc cacagtgcag 27420 
ggggagaggg ctgagagggc aacagttcac caagcggaac agaggctgcc ccggaggtca 27480 
gctggctccc cggcagctgc aggggtggct agcccactcg gagggcagcg agggcatacg 27540 
aggggctcca gggatgagtg gttgcccagc acagcacccc tgggaggccg ggggcacttc 27600 
tcaggtagtg ggggcacgag gctgctctgg cctgacctca gggactcaaa atactttggc 27660 
gataaattcc accgtgtccc acccctgctg gtaccccata cttacacaca gactggttca 27720 
gatgcagaca ctctcgcgca catactcgct cacacgggca catacacgtg cacacacagt 27780 
cacatgcgca cactcataca cacacaaata tccactcaca cgcatgcatg cacacacacg 27840 
gacacacaca ggctcacacg tatgcacgca tatgcgtgca cacgcacaca cacacacaca 27900 
cgctcacatc ctcccactcc cacactcagt tgctcagaca cacacacgcc tggctctcac 27960 
acaaacctgt tgggctctga aaggctccag cccttcccat gctcgtcaga agccagtcaa 28020 
tggcttccta agtcaccaca cagatcaaag aggtgaactt ggccacatgg cactctgctt 28080 
cctgagctcc caaacaccag ccttggtgag gacagaccct caccccacac cctcattccc 28140 
actaccctgg gcaggcccag aggaggggca tctgcaggat ctggcaacca gcccctcccg 28200 
cccggctcct gcagccggca ccatgggagt cagggggagg tcactgcaaa gggcaacagc 28260 
aagttggtgg ccccaggact agagcccagg ggtcttcagt cctactccag agcttggaca 28320 
ctgtcccaca gggcatggcc aagggaaggg cttccagagc cctgacttca gggaggaggg 28380 
caggcgggct cctgtggcag gcctggatgc atggccgccc actcctggga ctttctaacc 28440 
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tagaatatct aggtcaggct gggtgcagtg gctcacgcct gcaatcccaa cactttggga 28500 
ggccgaggag ggtggatcac ttgaggttag gagtttgaga ccagcctggc caacatggcg 28560 
aaaccctgtg tctactaaaa atacaaaacc tagccaggtg tggtagtgca cgcctgtaat 28620 
cacagctact caggaggctg aggcaggaga atcacttgaa ctcgggaggt ggaggttgca 28680 
gtgagctgag atcgtgccat tgcgcaaaga agatctaggc cggcccctca accggtgagg 28740 
tccaggctgg gagtgctgag agactgtggt gacactgaat gaactaacag gcaaagggct 28800 
tccaactgag cctgggggtg gtgggaaatg gctcttgtgt tctagtcaag acctctgcca 28860 
accagttctg acactgaccc agcacagaac ctgacaggtc agcaagggcc agggcttagc 28920 
acagcccagg taagggtgtg tgtacggccc ccagagtcac tcccaggctg caagaaaagg 28980 
gacaaaggag ggacaagggg tggccaagca aactgttccc tctgctcggg agtctggggt 29040 
gacctggcct agctggccag tggagctggg ccacctcccc ttaaactctc caccccggac 29100 
ttcgactcca aagctttcct gccacccacg ctctccccac ctgggatcac ggccaggccc 29160 
tgagccttca agggcccagg tgaactcagc cagactagga gctgaggagg acacagggca 29220 
gcttccagaa cggacccgag aaccactccc agcaggttct gcttccagac aaggagctgc 29280 
actttttcag ccaatgcaat tagaaagcca ggagaaggtg caaattccac ctgcctgagc 29340 
gtccgcactt cccaggccgc ccaccataca cacagcaaag atgtgtttaa ccattcaaac 29400 
ccatggccaa ccacatcggt tgcctcagac atgcaagttt taaaaaggaa cataactatg 29460 
ggccaggcac ggtggttcac gtctgtaatc ccagcacttt gggaggccga ggtgggtgga 29520 
tcacctgagg tcaggagttc gagaccagcc tagacaccat ggtgaaaccc catctgtacc 29580 
aaaactacaa aaattagctg ggcgtggtgg tgggcgcctg taatcccagc tacttgggaa 29640 
gctgaggcag gagaatcact tgaacccggg aggcgaaggt tgcagtgagc cgagattgtg 29700 
ccactgcact ccagcctggg caacaaggga gactccatct caattaaaaa aaaaaaaaaa 29760 
aaaaaggaac ataactatgg agtctcaagg ggaagtaatt ccttcaacaa taacaaatct 29820 
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tgaaagctga gctctttttt ttttttgaga caggatctcc tcactttgtc gcccaggctg 29880 
gagtgcagtg gtgggatcac agctcactgc agcctcgatc tcccaggctc aaatgatcct 29940 
cctacctcag cctcccaaga agctgggatt acaggtgcat accatcacac ccgattcatt 30000 
tttgtatact ttgaagagat ggggtctcac catgttgccc agtgtggtct tgaattcctg 30060 
gactcaggtg atctgcccgc cttggcctcc cagagtgctg ggattacagg cctgagccaa 30120 
cacccccacg ggttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30180 
cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgaaccaa 30240 
cccccccacg agttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30300 
cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgagccaa 30360 
cacccccacg ggttcatttt cagagtcaca ccgagtgctg gggttacagg cctgagccaa 30420 
cccccccacg ggttcatttt cagagtcaca ccctttttct gaaaaacaac ttgggctcat 30480 
gcaaattcga gagagagatg gtgacactcc ccgccccctg gacccaggtg gagtcgcagc 30540 
agggtttacc cgtgagcggg gtccaaggcg atggccctcg gctggtcaag gtcctgccag 30600 
aagagcacct tccgggatgt gccattgagg ttggccacct cgatgcggtt ggtctctgag 30660 
tccgtccagt acagcttctt gcccacccag tcgcaggcga ggccgtcggg agagaccagg 30720 
ccggagatga ccacgttctg cacggcggcc cccgtctggt tcaggtaggt ctgcttgatg 30780 
gcctcctcgc tcacgtctgt ccagtacacg gctcccttgg aaaactggaa gtccactgcg 30840 
gccgcatcct ccaggccgct gaccacgatg gtggactcca gcttgactcc gccggcgtcc 30900 
accagccgta cgtcccggcg gttggcaaat agcaggagcg gcgaggctgt ggggcagaag 30960 
caaaccgtga gggccactgg ctaagccagc aagatacaca gccctgggat ggagcactat 31020 
gcccagagca ctcctggtac tgccctgccc atgcccaaga cctccagttc cttcctccca 31080 
cccctaaggc gttgtcagga agttgcctgg gcagccccgg cccgcatcat tcagaggctc 31140 
ctgcagcgca gcaaacagcc ttcttcccac attcggtgac agcacctgtt tgtttaccaa 31200 
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ctgttacpc .gttccccca ga,a.ggg.g acccttccg ccatgcccaa ^ ^ 

a«cgtcc,cc agaggcaca ggggcccg, cc.gttc.gc agagaagcca ca.cccctt. 3I 320 
g«a cacagggga. ggg gaca.gc aggcacagca c.ggcca.gc .gc.cgc.ac 3Z380 
agacccagcc acagggccac attttttgag ggg^ ^ ^ 

aagafccc, te caa gtett tga cc actgt ^ 

ca,ca g c n ccca,ccacccc W a..cca. g «c,cccaccc t gc tt c,g g aca a cc 31560 
•acamaaa gggfcaac ggwgccac cccttggc[c ^ 

tctggcctca cctacacttc tctccaccac cccacccraa ♦ 

cccaccccaa gcattcctga tctgcctgcg 31740 

g ccag g «g g c.cc a cacc.ccc.g S cacc g ca g ccc t ca g cccc tt c. g cc.g. g ca 31800 
agaagccfca .ccacagac aa CggKtea „cccacaac g ggctC!l a tg agaaafcagg 31 860 

^^agacca.accaccagacacc.ca.acg.cg.accag.aggg.ccagc 31920 
aacccccaac acagactcag agggacaag aagc.aca.g aggag.gaac acaaga.g,g 31980 

Sacaggaggaggttaagggccccagggagc.cca^ccg.gttc.gc.g.cagcag 32040 
~* aaacaccccc aaaaaaca.c .gaagcc^g gcttgaaaca 32100 

gcgacattc cca^aaaa Cgcagaccc c.g ggtect c cgcgcaga. gggggagccc 321 60 

agccaaccccacac.ccc.ca.caccaagaaagagaaagccaaaacaaac^aaacag 32220 
ccaafcacaa .cacagaac, gaa^ ^ ^ 

«~ a^cagag catggcccag 32340 
gagaccccag cgcagtgggg tegat ^ cga cctccottrc 

s S BS cctcc«gcc aagacaa g ca 32400 

ctg g ctcaaa tcctggcccc teccattccc agga^acatt cteca,.,,, . 

ssa„acatg ctecacagga t ggg aggaca 32460 

cacagaggacctgaggccagg 

s 'S c ggcgcaccgccgccccacccgtgctgt 32520 
catena gpctacag, ^ 
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ggaaggcata ggccacaaat aaagacaaac gggacttcat gggaagctaa agattttgtg 32640 
catcaaaaga cactatcgag agagtaaaaa ggcaacccac agaatgagag aaaatatttc 32700 
caaatcatag atctactaag agattaatat ccatgaaata cagagaactc ctaaaactca 32760 
acaatgagaa aacaactaag ccaactcaaa aatgggcaaa caacttgaac agacatttct 32820 
ccaaagatga catataaatg gccaataaac acatcaaaac aggcttaata tatccctaat 32880 
catcagggaa atgcaaatca aaactacaat aagataccat cttgcaccaa ttaggacggc 32940 
tactatcaaa aaaacaaaat agcaagtgtt ggtgaggatc tggagcaact ggaacccttg 33000 
tgcaccactg gcaaaaatgt gaaatggtgc agctactatg gaaaacagca tggcagttcc 33060 
ccaaaaactt aaacacagaa ttaccatatg acccagcaat ttcgctttgg gttatatacc 33120 
caaaagaact gaaaacaggg acacaatcag atatgcatac accttggatc acagcagcat 33180 
ccttcccaac agctaaaaca tggaggcagc caggcatggt ggctcacgcc tgtaatccca 33240 
gcactttggg aggctgaggc gggtggatca cctgaggtca ggagttcgag accagcctgg 33300 
ccaacatggt gaaaccccgt ctctactaaa atacaaaaat tagctgggcg tagtgacggg 33360 
cacctgtaat cccagctact cacaagtctg aggcaggaga atcacttgaa ccctggaagt 33420 
ggacgttgca gtgagccaag attgcgccac tgcattccag cctgggtgac acagcgagac 33480 
tctgtctcaa aaaacagcaa aacaaaaaca aaaaaacaaa caaacatgga agcaacccaa 33540 
gcgtccctct actgagggat gaatagcggg gcaaaatctg ctccatccac acaatggagt 33600 
actattcagt ctcaaaaagg aaaaagattc tggtcaggca cggtggctca tgcctgtaat 33660 
cccagcactt ggggaggctg aggcgggtgg atcacctgaa gtcaggaatt caaggcccgc 33720 
ctggccaaga ctggcaccna gctacacana aagtatangg ccccggaaa 33769 

<210> 9 
<211> 72049 
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<212> DNA 
<213> Homo sapiens 

<220> 

<221> unsure 

<222> (8356),(8385),(38585) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 9 

tataccttgc gcggaccttc ggctcctgtg gtgaagacaa tatgaagaaa atagaaatta 60 
cccataattt tgccacacag acttagttgt gtccatgtat cttgtgcacc ttttttctgt 120 
ttacggatca aaatcgactt ttagggtcag gcgcggtggc tcacacctgt aatcccaaca 180 
ctttgggagg ctggagttgg ggttgggggg tggatcactg aagatcagga gtttgagacc 240 
agcctggcca acatggcgaa actccatctc tactaaaaat aaaagattag ccaggcgtgg 300 
tggtgggtgc ctctaatccc agctactccg gaggctgagg caggagaatc gcttgaaccc 360 
aggagacaga ggttgcagtg agccaggatc acgccactgc actccagcct ggcaacagag 420 
cgagactctg tctcaaaaaa aaaaataaaa ataaaataaa taaatacata aattgacttt 480 
taggagattg gttcaaacaa tgtgtgtaat gttgtgtctg agtgtttttc atttatcgtt 540 
catgcaaatt ccgacatcat tcactcttct ccagagtgtg ctgttttcct gcctgtgtca 600 
tcacccgtca ccttgaatgc cctcgtttag gtaaaataag tacattttat tcaaaaatat 660 
ttgaggac^tttgggttgtct(xaggttcttggtcttgagttttgctgttcttgtggagc 720 
catggtggtg tctggttgca ggaacctcca tgcgttccag ctgctgcttc tgcctgtgtt 780 
cttagagagg aaatgctggg gtccgcggtt cccgggctgc tgaccaggaa gcctgcggtg 840 
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ctttacggcc cttccagaag cgggagatgc ccccacttaa gtgtcagaca ggcctttcca 900 
cctcactggc agctctgagc ggctcccttc tatttgcaga tgactgagaa gttaccaatt 960 
tccacgttta ctgactgctg tttctcctgt taatttgtat ttatagtctt cgctaattta 1020 
ttgctagggt tttggtgttg tccctattga cttgtatgcc ttttaatttt ttaaacaaca 1080 
ttaatatact tcattttttt agagcagttt taagtttaca ggaaaattaa gggacaagta 1 140 
cagagagttc cttccacctg ctgtcctcct ctcctcctcc ccaccttccc tccttcccct 1200 
attgtaactt tctttctgat attataaaag tcactcatgg ctgggcgtgg tggctcacgc 1260 
ctgtaatccc agcacgttgg gaggcagagg caggcagatc acctgaggte aggagttcca 1320 
gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaacacaaaa agttagccag 1380 
gcgtggtggc gggcacctgt aatcccagct actcaggagg ctgaggcagg agaatggcgt 1440 
gaacctggga ggcagaggtt acagtgagtc gagatcgcgc cactgcactc cagcctgggc 1500 
aataagagtg aagcttcgtc tcaaaaacaa agtcacacac gcttcttgta cgagggtcat 1560 
ttggccgagg ggccagatgg ctcaccatct agttgggaca ggccatgagc tcggaatgct 1620 
ttttacatat ttacatggtt gagaagaaaa tcaggagaat aatgttttgg gacatgggaa 1680 
aatgacatgg aatttgcatt ttagtgtcca taaatgaagt tttgtttgct cccagctgtg 1740 
ttgactgagg caggctggct tcctacagct gcggcagagc tgaggaggcg ggaaggagac 1800 
cgtgcaggcc gcagcaccga aaatatttgc tctctggccc ttcccagagt gcttgccgac 1860 
ctctgtccga cagctagaag gaaggatagg acccgtccga cgataaccac tgttgacatt 1920 
tgagcgcgtt tccttcccgg cttttgtgtg agagtggcag tctgtttgct tttgtggtcg 1980 
ggatctgctg cacgcacggc gggctgtttg catgaggctt cctggaggat agggctgggc 2Q40 
tcggagctgc acgcagtggg gcgtgtcctg catgcagtgg ggcctcagaa gagagctgtg 2100 
gtgggcgggg cagtgccaac gctggtgggt gccaggcctc cacgctcaga tcagccccgg 2160 
cgacaggttt gggccaccxt ctctctggcc tctgtgcagt ggcccaggcc gtctgctctg 2220 
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cctggcacac ttgcctctgt ccttccactg aagcgctcct cttaccctct gctcccggct 2280 
gggtacgttg aattgtgtcc ctcaaggaga tatgctaaag gtctaacccc aggaacctgt 2340 
gtatgtgatc taatttggaa acagggtctt ggctgatgta atcaagcgag gatgaggtca 2400 
ccctagagta gggggcctat atccacggtg ctggtgtcct catgagagca ggtgagcaga 2460 
cactgacact caggggtgaa ggctgcatgg agtcagaaca gggcttagtg cgatggcggc 2520 
cacaagccaa ggaactccaa gtatttcctg caacaccaga agctggaaga tgccaggaag 2580 
gatcctgccc tggagccttc ggagggagtc tgtccctgca gacgtcttga cttttgattg 2640 
cagggatgca tgtcttaggg tgtgtggggg ggtgcatttc tgatgttaga agccacctgg 2700 
ttggtggcga tgtgtcacgg gagccctctg caggttctgc gtgtccatgt ggtcggggac 2760 
agaggtgggc agggacggac ggtgtcgagc tggacatgtc catgacgtcg gccatccctt 2820 
gggatggctt ttttgttttg aggataaggc tgcctgccag gaagctgtgc cctgcctggc 2880 
ccttgcccca agcccctggc ctgtgcttgg cctcgcggaa gggatgtcgc ccttctctcc 2940 
tgcatgcgtg cagggaggaa ggggagaggt cagcagcccg cctggaggag gctcgggcga 3000 
ggggaaggtt tcactttcag gcaatgttgt ggggctgttt aaacaacccc aaagaaaacc 3060 
atttggccaa actgttagtt tccaaacatt ttacttcctt ggtgtttaaa taaattccta 3120 
ccaagactct gtagctggtc ccagggaagg agttggcctc tcttctttat agcccggcac 3 180 
agtcagtccc ctgcacctgc ccctcccaac cccaggcctg cttccccgtg gccatggctg 3240 
ctgcccggac ctctctacac acagaacctc ctggaggcca gctgtgggca ccagccttgg 3300 . 
cagggctgtg gcggagccca ggctgctggt actctctctg cagctgctcc ctgctggcct 3360 
ggctggacag cgtccccacc accactgggg tcacctctgt gctggtcaca gctcactcag 3420 
accttcaggc aaatgggttg gatcctgcct ctctcccagg tgtctcagtc tctgcaaaac 3480 
tcaaaaacct cagaggcctt gcagcctgag gggtgtcaga gacacctcct tcgaatcagt 3540 
aaacacctac agattcaccc cagcagtgaa aggactgctt cgccacagag gtttgattta 3600 
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ctcctaagta attggaaggg atgccgagaa taggttcctc atggtgggac tagaggccct 3660 
ctgctgacct agttaacaga gggctagggc tgggtgtgct cagcccctga aggttctagg 3720 
cccatttggg acaccccgcc agaacctgcc acaacctgcc atgtggtgac agctacctaa 3780 
atcccagagg ctcttgagct ggagagcaga cctctcaatc tcagcaggcc ccccacacag 3840 
accccataac cctagtctgc cttcacagta cagttcgtgg ctatgtgttc acggatggtg 3900 
ttgttcacct aaggtctctg ccctgtgacc ccaagggcgt cctgagggca gattccaagt 3960 
ctgtttcgtc cacccctcct tccctagcag cgggtccagg gcctggcctg aactagcttc 4020 
ccacagagat actggtggga tgatgaaggc agccaggcgg caagtgaaaa acgcacttcc 4080 
tgcatgtgct ggctcctggg attgaagtgt ttgaggaagc aaagtgaagt gagctttcct 4140 
cttgcggctg tgtgtccttg ggccgggagc ctaccctctc tgagcgttgg ggtccttgtc 4200 
agtagaatgg ggcatcctca tagctcaagg ggtggtgtgt gaaaattgtg ctattgtgtt 4260 
actttaatga tttttttttt ttcgagacaa agtctcaccc caacgcgcag gctggagtgc 4320 
agtggcgcga tctcagctca ttgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 4380 
tcagcctccc aagtagctgg aattacagga gtgcgccacc aggcccggca tatttttcta 4440 
tttttagtag agagggggtt ttaccatgtt ggctaggctg gtcttgaact cctgacctca 4500 
ggtgatccac ctgcctcggc ctcccaaagt gctgggatta caagcatgag ccaccgcgcc 4560 
cggcctactt tagtgatttc ttaggaggac agagggaacg ggctggcaag acaggcttgg 4620 
aatgtgtttt gggatcaagt gccggtttct gtctggcact ggcgttctct gtggggccat 4680 
gatggacaca ctgctgaggt caagcgtgat tcgtcttgcg ctgtgcctgg cagtctcatt 4740 
ggaaagttct gtagacatcg tgtggatggg gctcttcccg gccaagccct tggggacctt 4800 
ccaggactgt gatctcccca cagtggctgt taagcaggga cctttcgtga agtggagtct 4860 
ctggtcccct ccaagtcata gctagacagg gactcgggca tcgccaagcc tggctgatta 4920 
ttcactggat gaggagacag gcccagagag gggcaggaac ctgcccgagg tcacccagca 4980 
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ggccccagag gtttcggtct cggattctcc ctgctcatcc ctggatgtag tgctgctgtg 5040 
gatgtggttc tgtgctgggg gctgtggaga gcagggggct tgtgccagga ccccagtgag 5 100 
ggtggcgccc tcgccatgag gccgactgtt ggtatggggc ggccatccac tggggtgtgg 5 160 
ggaggaacag ctttcctgag gaggaggtgg cgggaggaac agcttccctg aggaggaggt 5220 
ggcggtgctg tgtgacctgg gccttgaagg acaggtccat tgtcaacaga acattttggg 5280 
agtggagcct agagggagaa aatttgttga aattcagatt cccctccccc taccaataca 5340 
caccaaatca gatgcccctg accagatcta aatttggctc tcagagattt ccattgtagc 5400 
tgggcacttg gggaaccttc taagtgctgc ctctgcctct ccccagcctg cctgcctcag 5460 
tttccccagc cctgggcccg tgtcgctgtt gccatcacgt gggcgccctc tagtggagga 5520 
atcagattat gcactccggg gcttggagca ggagtcagga ggggctcctg tctttccttg 5580 
aaacgttgga tgccgggatc ctggaacagt ctctgcattc ctcctggcga gaaccagagc 5640 
ctgggcacag gggaccatct gttgtttgaa ggctgcagcc tggcagggca ctcaggagat 5700 
ctggcagttg gctgcagggc caggtctagg ggccagggca tcagggaggc tctgggctgg 5760 
ttcagccccg ggcccctttg cagattgtga cctgggcccc tgtgcagggg catggccaca 5820 
ggatgctggg aggggtctct gaccctgacc ttcttggctc tgtgcatcct tgagaccaga 5880 
aaggtctgga acaaatgagt agacgatgcc ctaacctggg gagggagcca catcctgatc 5940 
ccagcaacct cgggaaggat ctgtcaggat tatggggcac cctgggggcc ccaagtctgc 6000 
atgggtctcc acttgcaatt tctgtaggaa gctctgataa atccaaactg ggggtcctag 6060 
gacacagtca gaaatgctga taccgttgtg tgtggagcct cgggccctgg gggtcaggag 6120 
catgtggagg gtgggccacg ggggttcaga agagaatcct gtaacccccc accccccaaa 6180 
ctgaagccca cttgagggcc atggctgaaa ggttgggggg tctccgtgcg tcctgtggag 6240 
tgggtggtga ggagtccttg ggtttgcacg cctctgggcc tgagcggcgg gaccccgtcc 6300 
acagcggatc cctgggccct gttgctcaga tgctctcaga_gtgttgctgt ggccacggag 6360 
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ggagcctgag ttaagcttct cttgtgccgg ttgtacgctg tcaggtcaca ctggtgagtt 6420 
aggcagggca cagatgccca gagcagaggg aactttcctt ggggattcaa cacgtgcaag 6480 
tcttaggggc tggcaaatcc tgccctcagc tagagagggg gcttttattt gagaccagaa 6540 
tcacctgagc atcctcctgt ccccagctgt gtccagcctg tctgcaggga catcctgaga 6600 
ggaccaggct ctcccctcat ccacctgcct aagtgccact ctgaaccctg tccacctgtg 6660 
ccgtggaggg gcgtgacctc aagctgctca gccagcagca ggcttggccc tggggggcag 6720 
cagagaccca ggtggctgtg gggtgggtgc ttcgtggcgt ggttctgaaa cttcgttgga 6780 
agtgtgtgga cagtgccttg cctgttctct gtgggaccct atttagaaac gaggtctgag 6840 
ttactggggg tcatcactgt gttctgatgg cccagctgtg tggaggccgc ggtgcagccc 6900 
catccaagga gccagggccc tgggtctagc cgtgaccaga atgcatgccc cggaggtgtt 6960 
tctcatctcg cacctgtgtt gcctggtgtg tcaagtggtc gtgaaactct gtgttagctc 7020 
ttggtgttcc tgaaagtgcc cccgggtctc aggcctcaga accagggttt cccttcatct 7080 
cggtggcctg ggagcatctg ggcagttgag caaagagggc gattcacttg aaggatgtgt 7140 
ctggccctgc ctaggagccc cccggcacgg tgctggggcc tgaagctgcc ctcgggtggt 7200 
ggagaggagg gagcgatgaa gtggcgtcga gctgggcagg aagggtgagc ccctgcaagg 7260 
tgggcatgct ggggacgctg agcagcatgg ccagcagctg ggtctgcagc ctggtacccg 7320 
gcgggacttg tggttggggc tggtttgtgg ccaggagagg ggctggcagg agacaagggg 7380 
gactgtgagg cagctcccac ccagcagctg aagcccaatg gcctggctgt gtggctctca 7440 
gctgcgtgca taacctctca gtgcttcagt tctctcattt gtaaaatgag gaaacaaaca 7500 
gtgccagcct cccagaggtg tcatgaggat gaacgagtga ccatgtagca tgggctgggt 7560 
gcgtgtcacc taacatcacc agcctttgca aggagagccc tgggggcctg gctgagtatt 7620 
tcccttgccc ggcccacccc aggcctagac ttgtgcctgc tgcaggccct tgacccctga 7680 
ccccattgca cctgtctcca caggagccga ggaggtgctg ctgctggccc ggcggacgga 7740 
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cctacggagg a>c<cgc.gg acacgccgga cttcaccgac atcgtgctgc ^ 

catccggcacgccattgccatcgacacgacccgctagagggcatgectaaggaoaga 7860 
tgacgaggtg cgggcca.cc gcagggcgta cctggacggg .Cggggcg, agacgMggt 7920 

caacaccgag atcaacgacc ccgatggca, cgcggtcgac tgggtggccc gaaacctcta 7980 
ctggaccgac acgggcacgg accgcatega ggtgacgcgc ccaacggca cctcccgcaa 8040 
gatcctggtg tcggaggacc tggacgagcc ccgagccatc gcactgcacc ccgtgatggg 8I00 
gtaagacggg cgggggcgg ggcc.gga g c cagggccagg ccaagcacag gcgagaggga 8160 
gattgacctg gacctgtca. .ctgggacac .g^gca. cagaacccgg aggagggct. 8220 
gttaaaacac cggcagctgg gccccacccc cagagcgg.g ancaggagc .ccagggcgg 8280 
SSCgaagac aacaagcacc ccag.gg.cc ggtgcgcg cggpcca. 8340 

gcgtagaaagccc.gnaaac^gagggagccc tt ,g te cc CC ,g Ilcttcagtttcctcat ^ 
ctgtagaatg gaacggteca tc.ggg.ga, ttccaggatg ^ 

agcccg^acaagaccacagacaggc caggcc.cn awctttttrnntgag 8520 
a.ggag.«cac»g,cgcccaggc t ggag.g M g t gg tg%atctcagcteactat;aa ^ 

ccogccc ogggccaa gtgattacc .gcacagcc .cctg^g asggam , 8640 
agglgcctgc^pgcngge^^^^^^ ^ 

««tggcc ag gaggtcgca aac.cc.g.c «ca gg , gat ^ 

-g«gc*gg attacaggca .gagcccca cgcccggfca ggccaggcc, cm.g.aca 8820 
cmgcacac catgggM. ttcatccagg ggggtagga ^ 

otgaagccca gagaggcto gggacttgcc cagggfcaca cagcagga«g « ggcaggtgt s940 
g gggc.gggcag g ca g cg.gg C ,ca gctttccagcatagaaatctgtgaaagcagaa ^ 

<>««» /WW, gactttotga gacccgcccc agcggcKag 9ofio 

ccaggggcc. tecgggggc .^ccca gaacacgaa .gggaaaacc ctga.ggagg 9120 
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aggcgcagtg gagctgtggg tgccgatggg aagtcccaga ggagctggga ggtcagtagc 9180 
ggtgctgccc tctgtggagc acttagtggg caccaggtgt gtttccaggt tcatggccct 9240 
gggacctgaa gctcagaagg tgaagtaact tgcccagggc acccgtcggg cagcggcggg 9300 
cagaggattt gtgggctgtg gagcctgtgc tcgtggccca gccctggggg ttgtgagtgt 9360 
gctggccggg gagcttttcc tgcaagtgga ctggtgtcta ggagccagca tgtcaggcag 9420 
caggcagcgg gagtgcagca ggcagcggga gcacagcagg cagagggcgg ggctcgagca 9480 
gccatccgtg gaccctgggg cacggaggca tgtgggagag ggctgctcca tggcagtggc 9540 
tgaagggctg ggttgtgccc cgaggagggt ggatgagggt aagaagtggg gtccccaggg 9600 
gctttagcaa gaggaggccc aggaactggt tgccagctac agtgaaggga acacggccct 9660 
gaggtcagga gcttggtcaa gtcactgtct acatgggcct cggtgtcctc atctgtgaaa 9720 
aaggaaggga tggggaagct gactccaagg cccctcctag ccctggtttc atgagtctga 9780 
ggatcccagg gacatgggct tggcagtctg acctgtgagg tcgtggggtc cagggagggg 9840 
caccgagctg gaagcgggag gcagaggggc tggccggctg ggtcagacac agctgaagca 9900 
gaggctgtga cttggggcct cagaaccttc acccctgagc tgccacccca ggatctgggt 9960 
tccctccttg gggggcccca gggaacaagt cacctgtcct ttgcataggg gagcccttca 10020 
gctatgtgca gaaggttctg ctctgcccct tcctccctct aggtgctcag ctcctccagc 10080 
ccactagtca gatgtgaggc tgccccagac cctgggcagg gtcatttctg tccactgacc 10140 
tttgggatgg gagatgagct cttggcccct gagagtccaa gggctggtgt ggtgaaaccc 10200 
gcacagggtg gaagtgggca tccctgtccc aggggagccc ccagggactc tggtcactgg 10260 
gcttgccgct ggcatgctca gtcctccagc acttactgac accagcatct actgacacca 10320 
acatttacaa acaccgacat tgaccgacac cgacatttac cgacactgac atttaccaac 10380 
actgtttacc aacactgaca tctactgaca ctggcatcta ccaacactga catttaccga 10440 
cactgacatt taccaacact atttaccaac actgacatct actgacattg gcatctacca 10500 
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acaccaacat ttaccgacac caacatttac caacactgaa atttaccgac accgacattt 10560 
accgacaccg tttaccaaca ccgacgttta ccgacaccga catttaccga cactgatatt 10620 
taccaacact gacatctact gacgctggca tctactgaca ccgatgccag catctaccaa 10680 
caccgacatt taccaacact gacatttacc aacactgaca tttaccgaca ttgacattta 10740 
ctgacactga catctactga cactggcatc tactgacact gacgtttacc gacactagca 10800 
tctactgaca ctgacattta ccaacaccag catctaccaa caccgaeatt taccaacact 10860 
gacatttact gacactgata tctactgaca ctggcatcta ctgacaccaa catttaccaa 10920 
caccagcatc taccaacacc gacatttacc aacaccagca tttaccaaca ccgatgttta 10980 
ccaacgccga cgtttaccga cgccagcatc taccaacact gacatttacc gacaccgaca 11040 
tttaccgaca ctgacattta ctgacactga catctactga tactggcatc taccgacact 11100 
gatatttaccaacgccagca tctactgaca ctgatgttta ccaacaccga catttacgag U160 
caccgacatt tactgacacc aatatttact gacatcaaca tttagccatg tgatgggggc 11220 
cggcttgggg gcaggccttg ctcttggcac t ggggatgct gcagagacca gacagactca ^ 

tggggtcatggacttctgcttcttctccagcctcatgtactggacagactggggagagaa 11340 
ccctaaaatc gagtgtgcca acttggatgg gcaggagcgg cgtgtgctgg tcaatgcctc 11400 
cctcgggtgg cccaacggcc tggccctgga cctgcaggag gggaagctct actggggaga 11460 
cgccaagaca gacaagatcg aggtgaggct cctgtggaca tgtttgatcc aggaggccag H520 
gcccagccac cccctgcagc cagatgtacg tattggcgag gcaccgatgg gtgcctgtgc 11580 
tctgctattt ggccacatgg aatgcttgag aaaatagtta caatactttc tgacaaaaac 11640 
gccttgagag ggtagcgcta tacaacgtcc tgi^ttacg taagatgtta tcattcggcc 11700 
aggtgcctgt agacacagct acttggagac tgaggtggga ggatcgctgg agtccaagag 1 1760 
tttgaggcca gcccgggcaa aggggacaca ggaatcctct gcactgcttt tgccacttac H820 
tgtgagattt aaattatttc acaatacaaa attaagacaa aaagttaatc acatatccac H880 
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tgccctgctt aagacagaaa acatgggtgt tgttgaagcc agaggcagct gctggcctga 1 1940 
gtttggtgat tggttcctaa gcagttgaag gcagttttgt ttttccatag atgtctgttc 12000 
tccctttgct gggtgcagcc tcgccctgct gctgtggtcg ggtttcagtg gcctcgtccc 12060 
gtggacgcag cctcgccctg ccgctgtggt cgggtttcag tggcctcgtc ccgtggacgc 12120 
agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac gcagcctcgc 12180 
cctgccgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc gccctgccgc 12240 
tgtggtcggg tttcagtggc ctcgtcctgt ggacgcagcc tcgccctgcc gctgtggtcg 12300 
ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac ctgtggagcc 12360 
tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat tgtttgtttt 12420 
tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct cactgaaacc 12480 
tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc tgggattaca 12540 
agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg gtttcaccat 12600 
gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc ggcctcccaa 12660 
agtgttggga ttacaggcgt gagccaccgc gcccagccct ctgttgagca tattttgagg 12720 
ttctcttggt gccagtgata tgtacatgtg tccccatcgc accatcgtca cccattgagg 12780 
tgacattggt gcctctcctc ggggtggatg cctccctctg tttccagcaa cttctgaagg 12840 
attttcctga gctgcatcag tccttgttga cgtcaccatc ggggtcacct ttgctctcct 12900 
cagggctccc aggggaggcc cgaatcaggc agcttgcagg gcagggcagg atggagaaca 12960 
cgagtgtgtg tctgtgttgc aggatttcag accctgcttc tgagcgggag gagtttcagc 13020 
accttcaggg tggggaaccc agggatgggg gaggctgagt ggacgccctt cccacgaaaa 13080 
ccctaggagc tgcaggtgtg gccatttcct gctggagctc cttgtaaatg ttttgttttt 13140 
ggcaaggccc atgtttgcgg gccgctgagg atgatttgcc ttcacgcatc cccgctaccc 13200 
gtgggagcag gtcagggact cgcgtgtctg tggcacacca ggcctgtgac aggcgttgtt 13260 
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ccatgtactg tctcagcagt ggttttcttg agacagggtc tcgctcgctc acccaggcga 13320 
gagtgcagtg gcgcaatcac ggctcgctgt agcctcaatc tccctgggct caggtgatcc 13380 
tcctgcctca ccctctgagt agctgggact acagacacat accaccacac ccagctagtt 13440 
tttgtgtatt ttttgtgggg ggagatgggg tttcgctgtg gtgcccaagc tgatctcaaa 13500 
ctcctgaggc acaagcgatc cacctgcctc ggcctcccaa agtgctggga tgacaggcat 13560 
cagccgtcac acgcagctca atgattttat tgtggtaaaa taaacatagc acaaaattga 13620 
tgattttaac cattttaaag tgaacagttc aggctgggcg tggtggctta tgcttgtaat 13680 
cccagtactt tgagaggctg aggtgggcag atcacctgag gtcaggagtt tgagaccagc 13740 
ctggccaaca tgatgaaatc cagtctctac taaaaataca aaaattagcc gggcatggtg 1 3 800 
gcaggtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgagcccgg 13860 
gaggtggagg ttgcagtgat ctgagatcat gccactgcac tccaatctgt gtgacagagc 13920 
aagactctgt cttgaaaaat aaataaataa aaaaaatttt aaaaagtgaa caattcaggg 13980 
catttagtat gaggacaatg tggtgcaggt atctctgcta ctatctactt ctagaacact 14040 
ttcttctgcc ctgaaggaaa ccccatgccc accggcactc acgcccattc tcccctctct 14100 
cccagcctct gtcaaccact aatctacttt ctgtctctgg gggttcactt cttctggacg 14160 
ttttgtgtga ctggaatcct gcaatatgtg gtccctgcgt gtggcttctt tccatagcat 14220 
tgtgttttcc agattcaccc acacattgtc gcacgttatc agaatctcat tcctgactgg 14280 
gtgcagtggg ttaggcctgt aatcctaaca ttctgggagg ccaaggcggg acgatcactt 14340 
gaggcaggag tttgagacca gcctggccag cctagcaaga ccccagctac caaaaaattt 14400 
taaaagttaa ctgaacgtgg tggtggtggg cacttgtggt tcccagctac ctgggaggct 14460 
gaggttggag gatcgcttaa gcccaggagg tcaaggctgc agtgagctat gatcgcacca 14520 
ctgcactcca gcctggacaa cagagcaaga ccctgtctga aaaaaaaaac aaaaaaaaaa 14580 
gttcctttct ttttgtggct ggatgacatc ccattgtatg gccacagcac attttgtttg 14640 
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tetgtttate gggtggtggg cagtggtttc caccttttgt ctcctgtgaa taatgctgct 14700 
gtgaacattt gaattcaagt ttttgtttga acacctgttg tgaattattt ggatatatgt 14760 
gtaggggtag gattgctgag tcctatggta atgttaggtt tgacttactg aggaaccatt 14820 
aaactgtttt caacagtggc tgcgccgttc tgcatcccca ccggcagtgt gtgagggttc 14880 
tgactttacc tcctcacaaa cgcttctttt ccatttaaaa aaatattcag ccaggtgctc 14940 
tggctcacgc ctgtaatccc agcactttgg gaggccgtgg cgggcggatc acctgaggtc 15000 
aggagttcga gacgagcctg gccaacatgg tgtaacccca tctctaccaa aaatataaaa 15060 
attagccggg tgtggcagcg ggcgcctgta atcccagcta cttgggaggc tgaggcagga 15 120 
gaatcacttg aacccgggag gcagaggttg cagtgagcca agatcgcgcc actacactcc 15180 
agcctgggtg acaagagtga aactccatct aaaataaaac aaaaataaaa ataaataaaa 15240 
atttattaaa acattcatca cagccagcct agtgggtgtc ccatgtggct ttgcctcgca 15300 
tttccctgat aactaggatg ctgagcgtct tgtcccaggc ttgccacacc tcagcacttt 15360 
gagatacgtc gcacagtccc catttgcgaa cgagaaatga ggtttaggga acagcagctg 15420 
tgtcatgtca cacagcgagc agggggtctc tgagccgtct gaccccacag ccgaccaagc 15480 
tccaatcctt accgcctcct agtgttgtgg atgtagccca gggtgctccc acatttttca 15540 
gatgagaaca ccgaagctca aaacaggagc gttttgtcca cattggatac acgatgtctg 15600 
tggtttggtc ctgaagtcac tttatatctc agtggtccag actggagtag gacagggggt 15660 
tctggggaat ggggaaggtg tctcaggtga aaggaaggaa ttccagattc tccatactgt 15720 
ccttgggaag ttagaagact cagagggtct ggcaaagtca gacaaagcaa gagaaatgca 15780 
gtcaggagga agcggagctg tccaggaaca ggggggtcgc aggagctcac ccccaggaac 15840 
tacacttgct ggggccttcg tgtcacaatg acgtgagcac tgcgtgttga ttacccactt 15900 . 
tttttttttt tttgaggtgg agtctcgctc tcttgcccag tctggagtgc agtggcacga 15960 
tctcggctca ctgcaagctc tgcctcccgg gttcatgcca ttctcctgcc tcagcctccc 16020 
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gcgtagctgg gactacaggc gcctgccacc gcgcccggct aatttttgta tttttagtag 16080 
agatgggatt tcactacatt agccaggatg gtctcgatct cctgacctca tgatccgccc 16140 
gtctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgcgcccg gcccgatttc 16200 
ccactttaag aatctgtctg tacatcctca aagccctata cacagtgctg ggttgctata - 16260 
gggaatatga ggcttacagg ccatggtgct ggacacacag aagggacgga ggtcaggagg 16320 
tagaagggcg gagagaggga acaggcggag gtcacatcct tggctttcaa aatgggccag 16380 
ggagagacac cctctgagca tggtaggaca ggaaagcaag attggaacac attgagagca 16440 
accgaggtgg ctgggcgtgg tggcttacgc ctgtaatccc aacactttgg aaagctgagg 16500 
tgggtggatt gcttgaggcc aggagttcaa gaccagcctg gccaacatgg tgagaccccg 16560 
tctctactaa atatacaaaa attagccagg cgtgatggtg catacctgta atcccagctg 16620 
cttgggaggc tgaggcagga gaattgctta aacctgggag gcggaggttg cagtgagccg ■ 16680 
agatcccgcc actgcactcc agcctgggcc acagagtgag actccatctc aaaaaaaaaa 16740 
aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac agtagcagaa 16800 
gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta cctgacgatt 16860 
tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga gtgatggcgc 16920 
ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct cactgcaggc 16980 
cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat acattgcata 17040 
agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc cctggtggcg 17100 
cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac ctgcacgtcc 17160 
agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca cctcggatgt 17220 
gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt ttcctcctgc 17280 
aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct gcttctcagg 17340 
tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg gatgcgccag 17400 
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tgttcctgct gtggaccgcc atctcacatg agggtcttgg gcctgcaggc tcgttcagga 17460 
aacacccgct gagtatgcag tgtgtgccag ctgtgtccca ggcaatggcg gggacagtgg 17520 
ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt gcaaggagcc 17580 
acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg gaggtataaa 17640 
tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg ctatgtcggg 17700 
aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg acgtgttttg 17760 
ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg gtggtgtgcg 17820 
cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg gctagccggg 17880 
ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg agaggacttc 17940 
acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc tatgaaatga 18000 
ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg agcggagacg 18060 
gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt cctgggaatc 18120 
agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc cccatcctgc 18180 
gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag ccctgggcac 18240 
cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt gcttggagcc 18300 
gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg aggtcccctg 18360 
attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt ttttccagcc 18420 
tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc cttagccctc 18480 
ttcctgagtc tagagccccg ggggttggaa gtcctggccc ctgggacacc tgcagccaca 18540 
ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc acgttcttgc 18600 
ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct cccctctgcc 18660 
ctgcccttct cccctcctac cctgcccttc tctcctctgc cccgccatgg cttttatatc 18720 
ctgtgccaca agacatggct gtgtgtgaaa gtggcagggt ctggcatctc tgtgggtctc 18780 
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tgaggcccac gctccagtgc cactcttccc acccgctggc cgtgccctca tgctggaggg 18840 
acagcccagc cctctcccga accccagccc catgtgccca gctgcccccg gccctctccc 18900 
ctggaagccg gggtcactcc agccgtatgc catggtgggg acatcctgct tccttggcct 18960 
tccagggaag gtcctctttc caaatggcga cacctggtcc ctgcctggag gctggaagct 19020 
gtggcccttg tatgcccctc cagggtctgt gcgctcggtt ggcccgagtt cccatcaccg 19080 
tcatcatcac catcatcatt gtcatttcgc ttgtctgtga gccggcctgg tctcccagag 19140 
cagagaccct ctgaggtcca gcctgagttg gggtctccgt gctgacccct gacggggact 19200 
caggacgtac caggtctggg tcaggagtga cccccaaacc tcgtgccctt tgacaggcac 19260 
ccctgacttt tgctaagtgg gtggaggtga catcacttac agcgggagtg atgggacagg 19320 
gtctgttggc tgcactgtgc tcccagggat ctggggagag gctatatccc tgggctttgg 193 80 
cactgcagag ctgtgtgtgt ttgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 19440 
gtgtgtgttt gcgtgcgcgc acatgtgtat aagatctttt tttattacat gaagcaagat 19500 
aactgttgct gtttcctttt gggttttgtg ttcaacagag tggggtactt cttccctcag 19560 
acaacagaac tctccccttt aaacacgtgc tgtcagaggg tgggtcttgg gctcatgtct 19620 
gtttgcacag ccgagtcaga ggaaacacag ggttcttcat aaaaacactg cacagcaggc 19680 
gactgtccag agtcagcctg caggacggca gcagccctgc ccctcagagc acagctaggg 19740 
tgggctgctt tgggatctcc cgtcattccc tcccagctgg cagccggcgg ccggcccatt 19800 
ccttggtgtg ctggtcaggg gggcgtgcgc ctgctctgct caccctggga atgggacaga 19860 
agctggcagc tcggagagga cagggctgga cccttgggtg gcctctggct ggaccatctc 19920 
attgtcctca gacacagcct ctcgggtcta gtttcatttc ctgaaaaaca agtgcacaga 19980 
actagagcag gagtcgagag ctacggcccc cgggccagat ccagccctgc cacctgtttt 20040 
cacaccatgc tcaagctgag tgggttttac attttttaat tacttgaaaa aaaaaaagcc 20100 
aaaggaggtt tcatgaccca tgaaaattat atggaattca aaaaaaaaaa attatatgga 20160 
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attcaaattt cagtgtccat aaataatttc ttgagacagg gtctcgctct gtcacccagg 20220 
ctggagtgca gtgctatggc atggctcgct gtacccttga cctcccaggc tcaagcgatc 20280 
ctcctgtctc agcctcctga gtagctggga ctacgggtgt gtgccaccaa gcccggctaa 20340 
ttttttttta attttagtaa agacagggtc tttctatgtt gcccaggctt ttctggaact 20400 
ccatcttggc ctcccaaagt gctgggatta caggctcgag ccacggagcc cagcctgttt 20460 
ttgttttttc actgataaag ttttgccggg tgtggtagtg tgtgcctcta gcgatttggg 20520 
aggctgaggt gggaggatcg cttaagccca ggagtttgag gctgggctca agtgatcagg 20580 
aggtgaacta tgatcatgtc attgcattcc agcctgggtg acagagcaag aacctatctc 20640 
ttaaaaatat atatttaaaa agtattgggt gtggtggctc acgcctgtgg tcccagctac 20700 
ttaggcatct gaggtgggag gatggcttga gcccaggagt ttgaggttgc agcgagccaa 20760 
gatcgtgtca ctacactcta gcctgggtga cagagcccag accctgcctc tttaaaaaaa 20820 
aaaaccaaaa aacatgtatt ggaacacagc catgcctgtt cagtcacgtg ctctccatgc 20880 
tgctttctgc tccagagacc cttatggcct gaaagctgaa aatattttct atcctttaca 20940 
aaaaagtttg ctgacctctg tcctggaaaa ttcatctccc aagttctctt ccggcactgg 21000 
cgttcctggg tgtcctaaat ttggcccctg ttatttctga actctgtttt ggctctgttc 21060 
cctcccagga gccaggacag gcacgttctc tgcatcttgt cccctgacgc ccagaggctt 21 120 
ggctcggctc aggcattctt ggaaatatct ggctccagga aaggcagagg cctcctgagt 21 180 
cggcccagag ggaacctgcc ccaggtctgg gggaggcctg acccagcaga gtggcttttg 21240 
ccgatgggtt gggccggtca agatgtgctg aaagttgtcc tcagaaggcc actttgggat 21300 
tccttcctcc agtattagag caactgagag ctgctcattg caagcctgat gttttcccag 21360 
ttggccgggt ccaccgggtg ccctgggatt ctgggatctg ggtggaaagt agggggcttg 21420 
ggggagtgtc ctgggttctg gaatccaggt ggcaagtggt gaggttcagg gagtggcttc 21480 
tgagccacca taggggtctc tgtgggaggc tctgcccatc caggagattc cgcaggccct 21540 
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gccggcocag agccagcgtc ttgcgcttgc cgaggctaca gcoagcccca gccgggtgga 21600 
acagcccgtc gcctcctctc actttgtttt g gggC cacct gggagtgtgg agcaagggta 21660 
gagagggagg aagtggctgc cggccgctgc ccagcaccct tgtttgcctt gggccctctg 21720 
tgggctcctt tttattgctc ttcaatgaag ccagggaaat ggacttcctt gcctcacttc 21780 
agttcaacat gtctggaagt ttggtattaa aattaagaaa gtgtggaaat agagcaagaa 21840 
gagaaaaatc tctccaagag ataatagtga cctctgagct gggcgcggtg gctcacgcct 21900 
gtaaatccca gtactttggg aggctgaggc gggcagatca cctgaggtcg ggagtttgtg 21960 
accggcctga ccaagatgga gaaaccccgt ctctactaaa aataaataaa taaataaata 22020 
aataaataca aaattagcca ggcatggtgg cgcctgccta taatcccagc taaggcagga 22080 
gaatcgcttg aacctgggag gcaaaggttg cagtgagcca agatcacgcc attgcactct 22140 
agtctgggca acaagagtga aactccgtct caaaaaaaat aaataaataa aaaataaaaa 22200 
tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 22260 
gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 22320 
aaccccatct ctacaaaaat agaataaaat ttaagaggta atagtgacct tttggtagat 22380 
cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 22440 
ctgtgccgat gtccctcccc cagccctgtt attgtgagtg gaagaagggg aaagggttcg 22500 
cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 22560 
attgtacgccagggctgggtccctccctggagtggttctgtg^^ 22620 
cccctcagat gttttctgag tgtcacacac aggtgtgtgc attcatggcc tttgcgtgtc 22680 
ttcctgttgt ggaggcaaaa atgtgaagaa ccctagatga ttttgggacc agggctccat 22740 
cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 22800 
gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 22860 
ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 22920 
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gctccaggtt ggggccaggc cgttgtttgc tgttttgttg tttcttctgt gacgttagcc 22980 
actgggctaa tctgagcccc tcagttacag gtggagaaac tgagacccat gggggtgcaa 23040 
ggacttgccg aggacccaga gccccttggg ggcagagctg aggcggggcc tggctttggg 23100 
tcccagagct tccagtcccc ttcccgctct cctaacagct tttttttttg agacaagatc 23 160 
tcaccctgtc acccaggctg gagtgcaatg gcatgatctc ggctcactgc aatcttcgct 23220 
agctgcgttc cagcgattct cctgcctcag cctcccgagc agctgggatt acaggtgtgt 23280 
gccgccatgc ccagctcgtt tttttttgta cttttagtag agatagggtt tcaccatgtt 23340 
ggccaggctg atctcgaact cctgacctca aatgatccgc ctgcctcggc ctcccaaagt 23400 
gctaggatta caggctggga tcacactgtg cctggcccta gcagctttgt cctgtgccat 23460 
ccaacaacag atgaccgaag tctttgtttc ttaacatgca ttccatctgc cttacagttt 23520 
tgccacctgc aaaacagagg acttgtcgct tttctggtaa gctggaaatg taatctggta 23580 
gcaggaggcc. tgtggaagct tgcctttaat ggccttgtgt ctctttcatc ctgtcctgag 23640 
agccggagaa cttggatgtt gcacctaact caaccttcct gttaacatac agttctgcag 23700 
gctcatggat catcagaacc acgtcctatc tcacgcggct gtatgcttcc gttggttcag 23760 
gtgtttttac cttgacagta ttttctcctc ggtggctttt gcggtggttg cttttaatca 23820 
gcattgactc ttcaagaaaa atatttagct gctacatctc agaggagaca gggtggaaag 23880 
catctgagac ctgcaggctc agacttagaa ccagaagtgc cctcagagtt catccggccc 23940 
tgacccagcg ggaaatgagt tcacagagaa gcgggagaac tttgccccag gccctgccgt 24000 
tgctcataac tgccccaggt ccttacattt gctccaggtc ctgccccagg ccctgcagtt 24060 
gctcataact gccccaggtc cttatatttg ctccaggtcc tgccccaggt cctgcagttg 24120 
ctctgtgtgg tgggtgtgat ctggagccct ccgcccattg ctgcacctgg ggcaggcatt 24180 
gctaattgat cccaggactc cttcctgcgg agcacgccct ggttctccag gcagccgctg 24240 
cctgtcagcc tgcagtggtt cgggagagga cacctgcttg cctggtctgt tccaaatctt 24300 
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•UWme agoatocagg t g Cg g C a g gg aC a ggC c tgg ggclcgcggg cagggactc( ^ 

g gggteC oao actgcacctg ctt g tca g a g g cac<ca gt c aa tttttg c t 25800 
gatgaaggat gagagga^g aggacgtga. gottgctgot gca ttg cc.g oag.cc.ggg 25860 
'gagatgccc gggttgactc tgctgcccg, cgggtggacg , gatgtcaga tccccggct( ^ 
•aaaatacga ggg a g ctggg mafm& gcaggnggg ^ 

gaagcctgga gcgaggcag ,g,gg gcgac ccctggagca gtgagtga( ^ 
cttcacg^ ccctgcagtc ctcatgagg ggatgccatc catgaaffla ^ 
cc.oc.ttaa ^acgcjttc atgclggggc cggggcagtg ^ 
cccaccac .gggaggccg aggcgggtgg atcatgaggt caggaga{cg agaccaKct ^ 
ggc^ag gxga.acccc pcteaca aaaatacaaa ^ gccg ggtgcggtgg ^ 
cgggog^g^cocagc^cggg.g^^^^^^ 2634o 
aagcggagc, .gcag.gagc cgagattgcg eetcvast iatmt ^ 

cagagcga ga Ccopcca aaaaaaaaaa a.aaagBca .aaaaaaaa. .attagtcg 264S0 
ggtgtggta. cacgcgcca .aaKteac, ctgaggcgga gaattgc% ^ 

aacccaggag gfcg^ ^ ^ 

gaaamgag agtgtcttcc agggcaaca ^ 
aaaattagoc cggca.gg.g gcatatccct ^ 
gcaggaoao c^gg aggcagaggt tgaagtgagc ^ 
ecaso^ggtg^j^^^^^^^^ ^ 
taKcacca. gg aaggtgag actgacctgt gagtgangt tcaaagaaca ^ 27wo 
cccagagata agacaaaagg g^cfcca, gggggtgtga m ^ ^ 
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cttcttcccc ctcccctctc accccgtggt ttgctaaagg agatgggaaa aaggattctt 27120 
tttttggctg aaatatttaa cactaaatta aagccaattt taacagcact ttggttgatg 27180 
agtgaaatta acagactggc caaaaataaa cgaacggtct gtactatgtg aaaaagaggc 27240 
agctttggcc atgctgggcc aatgtgagtt ttcagggttg ctgggaatgt ctgtgaatcg 27300 
gaggaagggc ctagctggga ctctcaggag ccaaggccct gaggggcaac ttgcctggtc 27360 
cctgccctga ggcgttcact gctttcttcc tgggccagat cacaggcccg gaggctggac 27420 
cactgggctg gcactcttgc cgagctgctc cctgacttcc tgaccatgct cctttcagca 27480 
gccttgctgc actttagttt ccttgaatga aaaatgggga tgagaatagc tcctacctcc 27540 
aaggtgaatg gagtgagttc ggacaggtga ctccctggga ccagtgcctg gcgcctgaca 27600 
aggtccagtc agagcccgca ctgctgttac tgataccctt ggctgtacca ggggagaact 27660 
tggttgccat tgccaggtgt tctcccacca cccccactac tgtccctgtt tgatgtgtgg 27720 
cgggaataaa gctgtgcaca ttggagcttt tggcacatcc tggctttcag gtgaaaggtg 27780 
cgtgtgtgtt tgagggttta gcctggccaa cccagccatg aggtcggacc tgacctgggg 27840 
gtgagtcctg agctcggcac ccctgagctg tgtggctcac ggcagcattc attgtgtggc 27900 
ttgggccgca cccctttccc tgctgggctg ttgatgttta gactggagcc tctgtgttcg 27960 
cttccaggaa ccaacccgtg tgcggacagg aacggggggt gcagccacct gtgcttctgc 28020 
acaccccacg caacccggtg tggctgcccc atcggcctgg agctgctgag tgacatgaag 28080 
acctgcatcg tgcctgaggc cttcttggtc ttcaccagca gagccgccat ccacaggatc 28140 
tccctcgaga ccaataacaa cgacgtggcc atcccgctca cgggcgtcaa ggaggcctca 28200 
gccctggact ttgatgtgtc caacaaccac atctactgga cagacgtcag cctgaaggta 28260 
gcgtgggcca gaacgtgcac acaggcagcc tttatgggaa aaccttgcct ctgttcctgc 28320 
ctcaaaggct tcagacactt ttcttaaagc actatcgtat ttattgtaac gcagttcaag 28380 
ctaatcaaat atgagcaagc ctatttaaaa aaaaaaaaga tgattataat gagcaagtcc 28440 
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agcctattct aagmgta ttacatagc, ttaaaagatt tmatgac, naagtcaca 29880 
agggttctt.gtagaaaaaaa^taggaaag.a.aaaaagaaag.aaaaang, 29940 

cca t aacc t c t ccagccagagac g acc gt ,gagacacc,cagca Mg cc«aa gt c 30000 
«™c,c t aagatagca, actcca, cacagtcata .gctacgcag aattagta, 30060 
cctgamtt tcacttgaca ttacaacagg tatttgatgg cgcgtgaoa aactctttgg 30120 
cacaatctt, fcaa.gfc.g aaatactcca Cgcacaga. gtttgam aggcttaact 3mQ 
fcmt tttg cgtgt g ccgggcacag 3o24<) 

aacacmga ga^gagg caggaggalc acttgagccc ^ 

^aca«a g , g a g acc rat c,c t acaaaaaac tttttt aa t aag t cg ggcgt a g « ggt 30360 
Scatagctg, agtccoagcc accaaggagg ctgagttggg a ggattgctt gagccccagg ^ 

ag^a.gct^acctgaganacccacgtac.caa.cgagcgacagagca 304SO 

ag^ttgtctggggaaaaaaaaaaaaaaaaa.a.a.a.ata.a.aea.a .areata 30540 ' 

ta.aca.acacgcacacacaca.aa^.aaaaam.a.atnataaa^a.aa.a^ 30600 
^ to aaa t a ttt a ttt a t aaa b aaa ttt a t aaa«a t a«, aa a gtaaatatataa 30660 

ta t a«aa £ a t aaaaa at a t a tt a t a t a atatat a ill aaaa tetela a a «aaaaa at a 30720 
tama.aaataa.a.ataataca.act.a.aag^tatttaaaa^a.g.aa.g.a 30780 
tammaa tgMg a Kt ataatataca mafcaata cacatttata Kam^a 3 0840 

& aaa tot a t a t aaaa tett caag t tg cttttt ccaaaaa g g t g,c ttgctgcattKaa 30900 
acatteam atgctgg(ga ^ ^ ^ 

ccagtggcca agoa^gg gagatgttta ^ 
ggccacgoc <„ gcacMggg aggctgaggc actgaggtef 3io8o 

ggamcgagaccagccsccagcgg.gaaaccccatc.c^taa^caaaaa 3U40 
«ag gcg^ atgtgcctgt ^ 
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ggtagacaca cataagggct tttgtgaaat gcttgtgtga atgtgaaata tttgttgtcc 28500 
gttgagcttg acttcagaca ccccacccac tcccttgtcg gtgcccgttt gctcagcaga 28560 
ctctttcttc atttatagtg caaatgtaaa catccaggac aaatacagga agactttttt 28620 
tttttttttt tgagacagag tcttactctg ttgcccaggc tggagtaccg tagcgtgagc 28680 
tcagctcact gcaacctccg cctcccaggt tcaagcgatt cttctgcctc agcctcctga 28740 
gtagctggga ctacagacat gcaccaccac acccagctaa ttttttttat atttttagta 28800 
gagacagggt ttcatcatgt tggccaggct ggtcttgaac tcctgacctc aggggaacag 28860 
acggggttgg cctcccaaag ggcggaaata acaggggtga gccaccgttc ccggcctagg 28920 
aaaacttttt gccttctaaa gaagagttta gcaaactagt ctgtgggctg gccttctgat 28980 
tctgtaaaga aagtttgatt ggtggctggg tgcggtggct cacacctgta atcccagcac 29040 
tttgggaggc cgaggtgggc agatcacctg aggtcgggag ttcgagacca gcctcaccaa 29100 
cgtggagaaa ccccgtctct actaaaaata caaaaaaaaa attaaccggg catggcggcg 29160 
cctgcctgta atcgcagcta ctcaggaggc tgaagcagga gaattgcttg aacctgggag 29220 
gcggaggttg tggtgagctg agatggcacc attgcactcc agcctgggca acaaaagtga 29280 
aactccgtct cagaaaaaaa aaagtttgat tggtgtaacc aaagcgcatt tgtttatgga 29340 
ttgtctgtgg cagcttttgt tctgccgaga tgagttgtga cagatctgta tgggctctaa 29400 
agcctaaaac atgtgccatc cgccccttta cagaaaaagt gtgctgacct ctgttctaaa 29460 
gtattggaca actacaatgt ttgctcattt attattctat gatttgtttt ctgctttttg 29520 
ttgttgttgt tgttgttgag atagggtttc cctctgtcac tcaggctgga gtgcagtggt 29580 
gtaatttcag ctcactgcag cctcgacctc ctgggctcta gtgatcctct catctcagcc 29640 
tccctagtag ctgggactac aggcacacac caccactcct ggctgatttt tttttttttt 29700 
tttttttttt gtggagacag ggtttccgca tgttgcccag gctggtttca aactcctagg 29760 
ctcaaacacc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc caccatgccc 29820 
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agaategctt gaacccaggg ggcagaggtt gcagtgagcc ^ 
cag g c« gg gc aa ga aga gcg tcaaaaaMa ^ 

aaaa,g, ggc ecmtcctc c<cacc« g c t g cca g acca, ca g cc g cgcc tt ca«gaacg 31380 

ssa g «c ggt ggagcacstg gccttgac(a ^ ^ ^ 

ac.gga.ggg caagaaccc «ac. gg gcc g acac tg g g ac caaca g aa,c 31 500 

o«u»« p»« caagtcccg ,g tggaggga ccgaggfcgc ^ 

'ggccctgga .cccaccaag ggg^ ttgcctgtcc cgtgcgtcct ^ 
<W ca^ggg caggctg(cc g(g[ggcM 3ieso 

■wi"**^^^^ 3174o 

tc agaggtca tggagttcct tcgtggagcg e<*zctfrao<r ^ ♦ 

& b&^S ggtgctgagg ctgtatcagg cacagtgctg 31800 

g c tg c ttt cacc.g g g C cg«c to a C cgaag t g tcC a tgg ag Cctgcgtagggtgggtatc 3Jm [ 
**** facagatgc agaaacaggc .cagagaaac cg agtgactt Kcmaggtc ^ 
acatacccag nagagcaga g ctgggccag gaag[gctgt ^ 

^gcmgcactc^caaaaccatg^^^^^^ ^ 

"WW ccgaaa tgg c Ccttggagg c^agcc ac ag Ca gg acccacccg 32I 00 
agaggcaaat agccaggcg ^ 

ca^gcccca,^^^^^^^^ ^ 

ggacaca^cteaccccaaagcaccgtga^^^^^ ^ 
«**«* tgcta^aa ggaa ggaata cttcagga(g ^ ^ ^ 

cca ggtt » ga goaagg gc aggtcaaac, tagaaaattc t ggaa tg a g g at^catt, 32400 

.cgoaaa agaagaggga aggagg g gct gctgggggag ^ 
cc^^cc^^^^ 3252q 
gcKgtg.cc a.00^ ^ caagctt agmatgt 3238o 
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agtcccagct acttgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 42300 
gcggtgagcc gagatcgctt cactgcactc gagcctgggc aacagagcaa gactccgtct 42360 
cacgcaaaac tctgtctcac gcaagactcc gtctcaaaaa aaaaaagagt tcagggttta 42420 
tgaaactggc cagccgcgta aagtttgctg tgttgttttt gtgcccggga ggagtgtggc 42480 
cagggtgtca cgtcacacag tacacgtttc tcagatggtg gttctccaga ctgctgtccc 42540 
aaagtctgtt tttgcatctg gttcccacag acccaccctc cacggtgagc ctgattttgg 42600 
ccagggtagc tggaatcttg cttgtctttc agcccggcag ctgtaccagt ccagggtcca 42660 
cagctagtgg cttttaggaa ggaatttgtt cagttggctt tgacacatgg ccccctaggg 42720 
tccacagctc tgtagtgatg tggatgttgt tatctacaaa gacacatgat ccttcgtgtc 42780 
cagatgaaag tgatgatgtc tttgcagctg cccagcaagg ctgtgtgtgt gtgtgtgtgt 42840 
gtgtgtgtgt gtgtgtgtgg tgtgtgtgtg gtgtgtgtgt gtgtatgggg gagggaggca 42900 
ccctttccat ctgggggtgt gtgtgtgtgg ggtgtgtgtg tgtgtgtgcg cgtgtgtgtg 42960 
gtgtgtggtg tgtgtgtgtg tatgggggag gcaccctttc catctgggtc caagagactg 43020 
ggcctgggga agacgcttct ttttatctac ttagagactt tgttttattt gtattttttt 43080 
gagacagggt ctcactctgt cacccaggct ggggtatggt gatatgagca tagctcactg 43140 
cagcctcggc ctcccaggct gaagcgatcc tcccacctca gccttctgaa tagctgggac 43200 
tgtaggcgtg cgtcaccata ctgagctatt gttttttttg tttggttggt ttaatttttt 43260 
ttgatacaga tggagtcttg ctatgttgcc cagactagtc tcaaactcct gaactcaagt 43320 
gattctccca cctcagtttc ccgacattct gggatcacag gtgtgagcca ctgctgtctc 43380 
cctgttttat taactgctga aagacctaga taaagaaagt ctgaaaagac ttactatcag 43440 
agcaccatcc taagatgatt ccctctgact caatggagag ggaggggagc ttttccttca 43500 
ggcctgggtg gcaggagccc aggtgctcca ggccccattt gccccaggcc aaatcactcg 43560 
ggaacttgga tgcagctgtc tttcagggta acccaaagga acpagatccc cgcaggcagt 43620 
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aggcttctgg gctgtcctct cctcctacgt caaetaw*. . 

gt cagctcagta agagcccttc gaagggatgc 43680 

tgagagtcac tgagagacgt ggctecccta ^ 

gggctgccctgccttcccaagaggg g ct ctga g tcattccc ^ 

cccccactca cccgcccctg caccctcacc tgtcccccac m 

c igtcccccac cttcacctaa cccccaccct 44040 

gcttgagtcc tggaaggtgg aggagacagg gatgt£ttjrjT 

gggatgtgttg ggaagggccccatggtcttgg ^ 22o 

atcccttctcgactgtcaatggg gccttcat 

sss^gcgcc agtctagtga tgcacagctg 44280 
ggtgcccggc gggtggctg* ggasgccfco 

ggggCC£aaagtCC ^ c ggcaagagc tctt ccagagg 44340 
ctgttgtcct aatcgctctg gcatactcag gcg ggcaC£r t ^ 

g g gggcacgt agttaggagc tgattggaga 44400 

ggagagaccc ccacaccaat ar ta<, M « 

uaai act gggattt gactttcage cta^m^ ^ 

a °S ctaa acttgagaagtgtg gc 444^ 

^^^^^ 

gccgtgtctt ctgccgaatg ccagcctctc aca ggc t ggg ^ 

^ cc gSg gagactttcc accctgggga 44940 

tccaatgggt ggctttccag ggtcccaaaa 

gg.tcccaaaagcaaacacaggctctttcacagcccctcca 45000 
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ggaaagcaga aagccccaag ggctggaagg gaagggggag ctctgctgag aggttacaag 45060 
gcagcgctgg ccgacgggag ttgcagttga taggttttgt atcatccttg ttaaacttga 45120 
accctgtgca gaaatccctt ccacggcatg ggggctgcct gttgactcgc tcctgttcca 45180 
ccacagggag ctcctgggct tcttcctccc agaggccccc gacgctccca cctgttggtc 45240 
gtcagagctt ctggttggtg ggaaggcacc caggaccttg aggtctccag agagaaaagc 45300 
cagggaaaga gggagaccga aacccatgtg acatgaaact caggctccaa actgagcacg 45360 
ggaacgtttg gggacaggag cgcgatggcc ttcctcagat agctgggggg ctggcatgaa 45420 
gacgggagct acagccagca caggtcctgg gccgggagec cagagattga gccctgactc 45480 
tgtcacttac tggccacgtg accttgggcg ggtggcatag cctcttggag actcagtttc 45540 
ctcattggta ggagtgacgg ccacagtggt gcggcctctg cagcacacgg ggggctcggt 45600 
gggcggaagc cccgggtcta taaggcggct gtgcaggagc cagccgagct ggtctcccaa 45660 
cagccagggc tccggggtcc ttagcagctg tggggggcct gcacctgttt cccatggctg 45720 
ctgtcagaaa ttaccagaag ccaggtggct gagagtaatg gacacttgtt ctctcacagt 45780 
tcctgagggc tgaagcccga gatcgaggtg tgggcagggc cctgcgccct ctgaaggctc 45840 
tgagggaacc tttgggcttc tggtggctcc aggcacccct tgacttgtgg tcctgtcact 45900 
ccagtctctc tgtctggctg cacatggcgt ggcctcttct gtaccattga aggacacttc 45960 
agttggattt agggcctacc ctcacccatt gtggtcgtat cttgatcctt catgacattt 46020 
gtaaagaccc tgcttccaaa taagctcaca ttctgaggtt ctggggtgag cgggaatttg 46080 
gagagcattg ttcaactagt atagaatgtg acctgtcagc ctcgggcagc cctgagaggc 46140 
aggggctttc cacagcccag ctgggtgccc tgggctccgt gctgtccgag gagacgccat 46200 
ccccacaccc gtccttcacc cgccaccctc ccgcaggtac ctgtacttca ccaacatgca 46260 
ggaccgggca gccaagatcg aacgcgcagc cctggacggc accgagcgcg aggtcctctt 46320 
caccaccggc ctcatccgcc ctgtggccct ggtggtggac aacacactgg gcaagctgtt 46380 

BNSDOCID: <WO 0l92B91A2J_> 



WO 01/92891 



144 PCT/US01/16946 



ctgggtggac gcggacctga agcgcattga gagctgtgac ctgtcaggta cgcgccccgg 46440 
ggcctgccct aaccgcagac acccggcctt cattgtcagt aatggcagca gctgccacat 46500 
tgtccgagac ctgccgtgag cccagtgccg cgccaggggc tttgtgtgta gcgtgttttg 46560 
tcctcacact gacagctgta ggctggggtt ctgagtgagc cccacagggc agaggcagaa 46620 
aatgagtctc agagagggtg agcgagctgc ttggggcccc acagcaggag atggagcagg 46680 
actgcagcct agcctctgcc cccagcacct gcgcaagaag ctgctctgct ctggactgtg 46740 
ttaggctgcg agggctggag agaaatgaga gttggtgctt agagaggggg cgcaggtccc 46800 
catggctttt cctcttatga tgaggtagat gggtgaaggg aggggccatg cttgcagggg 46860 
ccagtgaccg aggcccgccg ttggaactga tggccttcat cccgagccca gcccaggtgg 46920 
gagcagggct ttccgagggc ttgtcttggg tcggcctgct tccagggact ctgctgcagc 46980 
tcccacccct gtccaaagca tggaatcccc caggctccct ggcagtcctg tcaacctctg 47040 
tcctcccaag ctgagtgtgg ggcaagttct ggaggtcagc actgctcagg ggggcccacg 47100 
ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 47160 
gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 47220 
gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 47280 
ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggctggcag 47340 
tggggtgggc agggtggcct ctaaacccga cccctggagg aggctggagg ccagtgcaag 47400 
atcctgtgtg gcctcagcca ggcggtggtc tctgccagat gccaactgtt gcccgctggg 47460 
gttcagcgac atgtccgaat gtcccgaggc ctctgaggtt gmtctm gccgcagaac 47520 
aaatcaccac gaacagcgtt ttaagacaac accaactctt tttttttttt ttttttttga 47580 
gtcaggatct tgctctgttg cccaggctgg ggtgccctgg tgcaaacaca gttcactgca 47640 
gcctcgacct ctgggcttaa ttaagtgaac accttgcctc agcctcccag gtagctggga 47700 
ctacaggtgg gcaccaccac acctggctaa tttttttttg tagagacggg gtttccccat 47760 
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gttgcccagg ctggtctgca actcctgggc acaagctatc tgcctgctgt ggcctcccaa 47820 
agtgctagga ttataggtgt gagccactgg cctgacaaca cccacggatt gtctctcagt 47880 
tctgtaaggc aaagtccagg cacagcgtgg ctcacctggg ttctctgctc agggtctcac 47940 
ggggccagaa tcaaggtgtc aggaacgctg ggccctcagc ggaggctctg tggagaaatt 48000 
agcttccttg ctcactcagc aggtagcagt tgtgggatcg aggttctgtt ttctctctgg 48060 
ttattggtcg gggaccactc tcagctccta gaggccaccc caggtccttg ccccgtggcc 48120 
ctctctgcct cagcagtggg ggctccctgc gtcagtccct cccgcacctt gagtctctct 48180 
gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 48240 
cctttgatag actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 48300 
tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 48360 
ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 48420 
gggaccacag tccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 48480 
gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 48540 
cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 48600 
aggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 48660 
caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 48720 
tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 48780 
actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 48840 
tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 48900 
gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 48960 
cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 49020 
ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 49080 
attttatttt attttatttt attttatttt atttatttat ttatttattt atttatttat 49140 
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tgagatggag tttcgttttt gtcgcccagg ctggagtgca gtggcgcgat ctcggttcac 49200 
tgcaacctct gcctcccagg ttcaagtgat tctcctacct cagccttcca agtaactggg 49260 
attacaggca cccaccacca taccaggcta atttttttgt atttttagta gagacgggtt 49320 
ttcgccattt tgcccaggct ggtcttcaac tcctagcctc aggtgatcca cgcacctcgg 49380 
cctcccaaag tgctgggatt acaggcatga gccaccacgc ctggcaccat ttgctatttt 49440 
aattcccatg tgtattagtg tcccacggct gctgtaacaa atgaccacaa actggatggc 49500 
ttaaagcaac agaaatggat tcccccaatg tgctggagac cagaagcctg cgaccaaact 49560 
gttgggaggg ctgtgcttcc tctgggggct ccagggagga tctatttgtt ggcccttcca 49620 
gtgctgtggg tgccagcgtt ccacacttgt ggatgcgccg cctcaacctc tgcccatctt 49680 
catgtgtcca tctcctttgt gtctgcgtct ttacctcttc ttcttgtctg tgttgcctct 49740 
tataaggacg tttgtcattg ggtttagggc ccacccaaat catccgagat gacctcgtct 49800 
tgagatcctt aacctgcaaa gacccttttt ccaaaaaaag gttatgctca cagattctag 49860 
gccttaagac atgggtgtat ctttctgggg ggcactatcc aaccccttat acaatgaaag 49920 
acgggaagag ggccaggtgt ggtagttcac gcctgtaatc tcagcacttt aggaagctga 49980 
agcgggagga tcacttgagc ccaggagttt acaagtagct aggcaacatg atgagacccc 50040 
atttctacaa aaagtaaaaa aaaaaaaaaa aaaaaaaaag ccaggtgtgg tggctcacac 50100 
ctgtaatccc agcactttgg gaggctgagg caggcagatc acgaggtcag gagattgaga 50160 
ccatcctggc taacacggtg aaaccccgtc tctactaaaa atacaaaaaa ttatggccgg 50220 
gcgcagtggc tcccgcctgt aatcccagca ctttgggagg ccgaggtggg tgaattacaa 50280 
ggtoaagaga tcgagaccat cttggctaac acggtgaaac cccatcaaga tcacaaggtc 50340 
aagagatgga gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaatacaaaa 50400 
aattagccgg gcatggtagc gggcgcctgt agtcccagct gctcgggagg ctgaggcagg 50460 
agaatggcgt gaacccggga ggcggagctt gcggtgagcc gJga tcgctc catgccattg 50520 
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atgggacgtg ctgacaggtc ctctgccggg ttcctgcctt gctatgcgca cgctggtcac 54720 
cacagaggcc tggcccttct tctgtagcag tcccacaccc gcaacaggtg tggctgctga 54780 
ccacctgctt tctgcccctc tggtcctgag gagggcgcag tgggcactca ggcgtggctg 54840 
agcagatgtg tgttgccggg aggaggaagg actgctccag tcagggctga atttcccacc 54900 
cggagcattt ctgctgtatt tggtgtagcg dctgctgctt aaagctctga ttcccagttg 54960 
gcaccctttc ccttctgcat tgaaaaacat acggatgcat gtcttcttgc agtgaatgtg 55020 
tattctccca gcctctcttc tgggttgggg ctggaggtgg agcggcacac aggagccgca 55080 
gcgatggagg atgtgcgggt gcagcacccc gtacagcagg gatgccaaac ccgcgctgag 55140 
tccctctcaa cttctgcttt gaagcccagt cacgccattg cctgggtttt gctgggcggg 55200 
gctgcatgtg atgttctcct ctgtccctcc cccagagccg cccacctgct ccccggacca 55260 
gtttgcatgt gccacagggg agatcgactg tatccccggg gcctggcgct gtgacggctt 55320 
tcccgagtgc gatgaccaga gcgacgagga gggctgcccc gtgtgctccg ccgcccagtt 55380 
cccctgcgcg cggggtcagt gtgtggacct gcgcctgcgc tgcgacggcg aggcagactg 55440 
tcaggaccgc tcagacgagg tggactgtga cggtgaggcc ctccccgtca aggctctgcc 55500 
aagaccctgg ccctgccctc cgggatacga gcttggggct gcctccggcc tcacaggagt 55560 
aggggctctg aaaacctttg cttgcaggga gattgccaag tctgtctttt aggcccaaca 55620 
aggaaaactc tgcagttcca cccatcctgt cccaccaggt agtgtggctt gaaggcagac 55680 
tgtgagggtc tatctcacct tcctgcatta ggtcaggagt ttcacagaaa cctgaggcac 55740 
attcaggggt gggctgcaga ggtccatggc tcacaccctg gaaaatccgc ccccaaaaga 55800 
cagtgctgtc tccactgacc agtctgtggg atagtgctta agcctgagtg gtttctatca 55860 
acatgtagaa tcaggaggta taaagagatt tgctcaggca tcctgggccc tctctgacca 55920 
gcaggatctt cctttagatc ttgacagtga aacacatctc ttctgtgccc cctgtgagtt 55980 
ttctttcatt cattcattca ttcattcatt cattcattca ttcgagacag^ agtcttgctc 56040 
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cactccagcc tgggtgacag agtgagactc cgtctcaaaa aaaaaaaaaa aaagaaaatt 50580 
agccaggcac agtggcaggt gcctattgtc ccagctactt gggaggctaa ggcaggagaa 50640 
tggcatgaac ccgggaggtg gagtttgcag tgagccgaga tcatgccact gcgctccagc 50700 
ctgggcgata gagcaagact ctgtctcaaa aaaaaaagcc aggcatggtg gtgcatgcct 50760 
gtagtcccag ctactcaaga ggctgaggca ggagggttgt tcgacccacg gagatcaagg 50820 
ctacagtgag ccatgatcgc accactgccc tccagcctgg gtgacagagt gtgaccctgt 50880 
ctcaaagtaa gtaaatagga ggagagacaa gtgggcagtt cagactgatg gtatgggcac 50940 
agtagagact ggtgcagaca ggctggcctg tgatgtcaag caacttctgt aactgtttcc 51000 
ggcatccatt tgtgtgtcaa tttccgtgtc agtaggaaga ctctgtaggc tgccaagagg 51060 
aataagtggg aggatcctcc cagagaggcc gggcctgcag gagggccagt tctcatgagt 51120 
tcttatttgg cccctaccct ccaggctgtg gttctgaggt gggagacaga gcctgacctc 5 1 180 
tgtttgtctt gttttgtctt tgcagcagcc cacccatgtg cccgtgacaa tggtggctgc 51240 
tcccacatct gtattgccaa gggtgatggg acaccacggt gctcatgccc agtccacctc 51300 
gtgctcctgc agaacctgct gacctgtgga ggtaggtgtg acctaggtgc tcctttgggg 5 1 360 
tgatggacag gtacctgatt ctctgcctgc taggctgctg cctggcatcc ttttaaaatc 51420 
acagtccctg tggcatccag tttccaaagc tgattgtgtc ttcctttgcc ctcctttctt 51480 
ttctactatg tgcattcggt gctatgaatt ttcxtctaag tactgcgttt cctgcatctc 51540 
acaaattttg ttacattttc attttcaggt agtttgaata tttttacact tctcctgaga 5 1600 
tgacatcttt ggctcatgtg ttatttagaa gtgttgctta gtttctaaag agttggggct 51660 
tttccagctg tctctctgca actgatttct aatttaattc tactgtagtc tgagagctta 51720 
ttttatatga tttctgttat tttaaatgtg ttgggtgtgg tgtttttgtt gttattgttt 5 1780 
ttgtgtcttt ttgttttgtt ttgcttcgtt tgttttgttt ttgagacagt gtcttgctct 5 1 840 
gtcactcagg ctggagtgca atggcgcgat ctcagctcac cgcaacctct gcctcccggg 5 1900 
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tgtcacccag gctggagtgc cctggtgtaa tctcggctca ctgcaacctc tgcctccagg 56100 
gttcaatcga ttctcctgcc tcagcctccc gagtagctgg gatgacaggt gcgcaccacc 56160 
atgcctggct aatttttgta tttttagtag agacagggtt tcaccatgtt ggccaggctg 56220 
gtctcgaact cctgacctca ggtgatccgc ccgcctcagc ctcccaaagt gctgggatta 56280 
caggcatgag ccaccgcgcc cggcctgagt tttcctttta tgaaggacct gcttggttgg 56340 
ttgcctgcca catgttgtca gcaccatggg cccaggactg ctgaggagct gttgatgccc 56400 
tcgctctccc agagccaccg gctctgttag ataattcaca tgcagtctgg ccactgtcct 56460 
acgtcctcat tcacaaagag cagacatttc gtagaagatg agggcctggg agtaacctcc 56520 
ctgcatgttt ttctataaag gcatagtggt taagtccttc cagctcattg accattggag 56580 
aattttatgg aggctgtaga ctaggggctg gtaaactaag ggcccagggg ccaaatccag 56640 
cctgccacct acttttgtaa ataaagtttt cttggtgcac agccatgccc attcattcat 56700 
ttgcacaatg tctgtggctg ctttcatgcc aaaagcagga gaactgagtg gttatgctgg 56760 
agacctacgg ccttcaaagc cccagacctc acgtctggcc cttgacagac agagcttccc 56820 
cagccctgct gcgcatcctg gcccagcatg tgctgtgtgt gtgatttcag cttgcaggag 56880 
ccgtggttag gaattgtccc tgtgttggtc cattttgcat tgctatgaag gagcacctga 56940 
ggccgggtag attatgaagg aaagaggtct gtctggctca tggttctgta ggcagcacca 57000 
gtatggcacc cgcatctgct cagcttctag tgaggtctca ggaagctttg actcatggtg 57060 
gaagtcgaag cgggagcagg tgcatcacat ggtgagagag ggagcaacgg agagagagag 57120 
agagagagag agagcgcctc tccctcttgc cctcaccttg agaggagatg ccaggctcct 57180 
ttaagtaacc agctcccatg tgaactcaca gtgagagccc atttgctact gcggagaggg 57240 
caccaggcat ctgctcccat gacccaaaca ctgcccacca ggccctacct ccaaccttgg 57300 
ggtcatattt tattctgttc tatgctatgc tatgctatgc catgccatgc catgccatgc 57360 
txtcam ctattamg agacagaatc tcgctctgtt gcccaggctg g^gtg^gtg 57420 
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gcatgatctt ggctcactgc aacctccacc tcccaggttc aagcgattct cctgcctcag 57480 
cctcccgagt agctgggatt acaggcacac accaccacac ccgggtaatt tttgtatttt 57540 
caatagagat ggggtttcac catgttggcc aggctggtct caaactcctg gcctcaagtg 57600 
atccacttac ctcggcctcc caaagtgcca tgattacaga tgtgagtcac tgcgcccagt 57660 
gagggtcaca tttccgttga gatttggagg ggcagacgtt ggagccatct gagccccctc 57720 
gtcccgctct agcttctcct cccgtgtgcc ccgcggtgct ggtggcaggc ccttacgccg 57780 
gttctggctg cacgctctgt tccagaagct ttcttccctg cttggttacc agaaaatcat 57840 
cccatccatt acaaggacag ggtcccctta tctcccattc ccagggcagg acaccggggg 57900 
cagggcaggt ggggaactga gcaagttctc tgggggcagg cgtggctatg gctccctctg 57960 
ggtgggcgtctggggaggggtggaggcagccgtcagcgccctggcttgctcttcctccct 58020 
ggccagagac tgtggccttg tgctgctccc gtgtgggctg cctgcacctc cagtgggttg 58080 
tgctccctcc cctcccctcc cctcaagctc tgctgagcac cactgccttc cacagccccc 58 140 
actctcggga ggcgaggctc ctcgtggcca ttcctgtcct tggcacccac ccccccacca 58200 
acctggtaga gccttgggcg gggtctgtta ctccttgcat ggcgtagacc tccccacagt 58260 
aggcacctga cacatacctc ctggggggca ggcaggaggt gcgttgaggt ctcagccctg 58320 
gcagtccctc ccctgcgtgg cataggcctc gccacagggt catcgagggt gggtggagac 58380 
tgtactagac cactccccgc tggtcctaga aagggtccca tctgtctgct ctctgtttgg 58440 
agtccagacc ttggttgctg tgccctgcat ggtgggctgg ggggcaccct ccagcctctc 58500 
tgagtgcatg gcctctcctt gcagccatct gcctgcccaa ccagttccgg tgtgcgagcg 58560 
gccagtgtgt cctcatcaaa cagcagtgcg actccttccc cgactgtatc gacggctccg 58620 
acgagctcat gtgtggtgag ccagcttctg gcacggggaa ggggcgtccg ggctgggttc 58680 
ccccaggaac gtggagttta ggggaggaga cgtgcctttc cagcggggct gggggctgtg 58740 
tgggagactc aggcggctgg gaggctcctt gcgggaggca gggaagcctt tcccagggca 58800 
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ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 65760 
tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 65820 
cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 65880 
ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 65940 
gaccagcctg gccaacatag tgaaacccca tctctactaa aaatacaaaa aaattagctg 66000 
ggtgtggtgg cgcatgcctg taatcccagc ttctcgggag tctgaggctg gagaatcact 66060 
tgaacctggg aggcagaggt tgtagtgagc cgagatcatg ccattgtact ccagcctggg 66120 
caacaagagc gagactcagt caaaacaaca acaacgcagg aatagcagat gagccgaggt 66180 
ggggcctccc cagcccccac cccccacccc gcaccctggg ccgagatcca gtcctctttg 66240 
aatagggcct gggcgtggtt cacgggacat ctgagacatt gccgaggcgc tgcactggtg 66300 
gatcttgcca gaagtctgcc cagtgcagat ttgggcagaa tctcaaactg ccttgggatg 66360 
taggagagaa accaggcctg gtcaagttca tgggaagagg tggaaacaga ccccataggc 66420 
tggggcttgg gcagctgtag gaagccctct ctgctgcctc cctgcctgct ctctgctttg 66480 
aagcatcttc cccagtgccc ccagtctcat gccctctcaa cgttggggtc aaatcctgag 66540 
gaatacccag actggctctc tgggccaaag aggaccctct ccagaaagag cagggcccag 66600 
tgcggcttcc taaagggcag gggaagggcc tggccactcc ccagaggcta ctcaccagcc 66660 
atcaggatag ccccaggaag caggccttct cgagcccatt ttattacttt attttattat 66720 
tttatttaat tttaaattta ttttttgaga cagagtctca ctctgttgcc caggctggag 66780 
tgcagtggtg cgatctcaac ccactgcagc ctctgcctcc agggttcaag ggattctccc 66840 
acctcagcct cccaagtagc tgggattaca ggtgcccgcc accacacccg gctaattttc 66900 
atatttttag tagagacgag gtttcaccat gttggccagg ctggtctcga actcctgacc 66960 
tcaagtgatc cgcccgcctc ggcctcccaa agtgctaggt caagcccatt ttaaagttga 67020 
agaaactgag gctgaggtaa attccctccc cagggatcct gctgcagcca gaaggtggta 67080 
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ggctcccagc tgccgtgtgc accctgccta gagctctacc gtaacccatc tccgggagga 64380 
ggtgctattg ttttcctcat tttgcaacaa ggaggctgaa gaactgagca tgaaccactg 64440 
gcctgggtcg ttcggttggt aggcagtggg gccaggccat ccaactcaca accaccttct 64500 
actctgcttc ccccgcaccc tgaagtttgt tctgttttga ggacacagcc gtcacattct 64560 
tggtggctga acagcactcc ttgtcaggtg tggctgggcc cccactggag ggcatcatgg 64620 
tcctctctcc tgctgcggtt gaaccttggc tgtttcaacc actcctgcca agtggccctc 64680 
tgaaagggac agtccatctt ttctcagcag agggccacac tggcaaaacg gtccctggca 64740 
ccctttctct ccacctgtct aatatagagt aaaaatggta tcatgttaag atcttcattt 64800 
atatttattt tatcatgaat gatgtaagca tcattttgtg tgtttaagaa cctttgggcc 64860 
cagcgtgatg gcttgcagct gtaatctcag cactttagga ggctgagatg agcggatcac 64920 
ttgaggccgg gagtttgaga ccagcctggc caacatggag aaaccccgtc tctagtaaaa 64980 
atttaaaaat tagccgggta tggtgatccc agctacttgg gagtctgaag catgagaatt 65040 
gcttgaacat gggaggcgga ggttgcagtg agccgagatc gcgccattgc actccagcct 65100 

gggcgacaga gcgagactct gtctcacaaa aaaaaaaaaa aaagaaaaga aaagaaatta 65160 

tcaatctcct cttttatggc atatatatat atatatatat atatatatat ttatttccct 65220 

ttcttggtta tgttcataaa ggcctcccct gctctgatca taaaaaacaa cttattttca 65280 

cactctctct cttttttttt tgagacagag ttttgctcct gttgcccagg ctggagtgca 65340 

gtggcgcaat ctcagctcac tgtaacctcc gcctcccggg ttggagtgat tctcctgcct 65400 

taccttcccg agtagctggg attataggca tgcaccacca tgcctggcta attttgtact 65460 

tttagtagag acgggggttt ctccatgttg gtcaggctgg tctcgaactc gcgacctcag 65520 

gtgatccacc cacctcggcc tcccaaagtg ctgggattac agacgtgagc caccatgc'cc 65580 

agcccacact ctctttctta acgtcctcct cctttcgttt tacgttcaca tctttaattc 65640 

ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 65700 
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cggggccggg gaggggcggg gcgggatggg gctgtgggcc cctcccaccg tcagtgctgg 61620 
ccaccggagg cttcccgggt tcctgggggc tgtgccaccg cctctgaggc atgcttgctt 61680 
tcttcccttt tcaaaccctt ctgcttcctt ctttaatgac attgttgatt gtggataatc 61740 
tgaaaactac acaaaaatat aaagagccaa aatctcaccc aaatccacct cctagagtgg 61800 
ct gttgggct ccgtcagcat ccaggcggcc gtctgtgtte cgcacggccc agcccatcga 61860 
tagccgcctg caccaggcct gtctgccctc tgtgagcctc cccacagggt tccctccaca 61920 
aacaccctgt tctcccaccc agggctggct gcttcctgga aaacagctgg atggttttgt 61980 
gcatgacaga caaacacagg gtgattttcg tggctaaaat actccctgga gcttttggca 62040 
gggtgagggg ctggctccag ctgagccacg ccttgagtga aatgactgtg aggagaataa 62 100 
actgccgctg ccctccagga tcactggggc tggctgggga gaacccccgt ttctgggagc 62160 
acagtcccag gatgccaagg cgagcttggt gccgagatgt gaactcctga gtgtaaacag 62220 
cgggggctga cttgacatgc tttgtatgct ttteatttgt tcctgcagct gtatgcccct 62280 
aaggtgagtc cagccccctt ctgcttcctc tggggcctcg ccagtgagcc ccaccttgct 62340 
ggggctggtt cctcctgccc ttctgggtat ccctcacatc tggggtcttg tcttcttgtt 62400 
ttatttttct LLLiLLLLLg agacggagtt tcacttttgt tgcccaggct tcagtgcaat 62460 
ggtgtgatct ctaggctcac cgcaacctct gcctcccagg ttcaagcagt tctcctgcct 62520 
cagcctccct agtagctggg attacaggca tgtgccacca cgcccagcta attttgtatt 62580 
tttagtagag atggggtttc tccatgttgg tcaggctgat cttgaactcc ctacctcagg 62640 
tgatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 62700 
gcctttttct tttcttttct tttctttttt ctgagacagg gtctcgctct gtcacccagg 62760 
ctggagtgca atggtgtcat catggctaac tgcagcctct accttctagg ctcaagcaat 62820 
cctcccatct cagcccctaa gtagctagga ctgcacgcat gcatccccat gcccagctaa 62880 
tatttacatt ttttgtagag atgaagtttc actatattgc ccaggctgg_t ctccaactcc 62940 
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aaacaggact tcacccgggt ctgtctggcg tgaaaggcag tgflcttgta ccaccctagg 67 140 
gggcctgaga gaactgagtc cctcgggcat aactgacagt tctgttccca ttattccgca 67200 
ggggctcgga tctggctgta tgctttccag gatggccttg gagacccaca taagccctac 67260 
accctttggg aagctgcatg ttgggttggg gtgccgtcag tggcacttgt ggaaggtgca 67320 
gacctgtgtg ggtgtgtggg cccagggccc ctggtccctt cctccctttg tagggctggt 67380 
tgtgtgctgc ctggacctgg ggggcacgtt cacgtggtga atttgtctat ttactatccc 67440 
cgctttgggg ctggtgccag cacaggccct tgtgaagggg gtgcctttgt ctggagtggg 67500 
actgtggccc ctccctcagc gtggtgactt ctgtgtcagg gcttcagcag ggacgcagag 67560 
cccctgagtg ttcggaacaa gggcgtcatt gcaggagtta gactgtgtgt gatggaggga 67620 
ggaggggcag gaggaaaggt cagaaggaga gttcctggga aggtccctga ggagcctggt 67680 
gaggtgctaa ctggtgtgga ggacactcag ggcctgtggg gacatctcct actgctgggg 67740 
gccagccaca aagggaactg gccgaagtcc tgtccccgcc ttcacagccc agcatctggt 67800 
cacaaggcag gtacttggaa gggcgcgggc acctgggcca aaagtgcctg ggttcccttt 67860 
gcctttcact gagatgacct tcggggcagg tggctgctgc ctcccctcct gtccccaggt 67920 
tttgccaact ggccagagga aggggtcctg ggaagcaggg gggccagaag ccctctctgc 67980 
aaggaaagcc cgaggggtgt gggaggaagg aaggaatgcc caggctggcg aggctctaag 68040 
tcaccctggc ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc 68100 
tccctgtaca acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac 68 160 
aggtaggaca tcccctgcag ccctccatgg ccattgggtt cccgccagcc cgtggtggag 68220 
gggcctaatc cccatgccac tgatgagggg aggtattctg ggtgctagtg ggcaggtgcc 68280 
gggcccagcc ctgcctccct ctgctctgcc aaccacacta ggctgcctcc ccagacaagc 68340 
tcagcgggca ctgcatgttg ggttcagaaa tcagcagaac tccacgttct gagctgctct 68400 
tcaagttgct cctatggggg ttacttttaa gctgggaaat ggc;gtggcg tcgaggggcc 68460 
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ctaattaaag agacggggca gtggacaggc attttcagtt gactgcccag ggagtgttct 180 
gcccaacagg gaggatatgc gtacagaatc atactcgatc agcatgagtc caattcagac 240 ' 
cgtacatcag tggagatatg ggtcccccga tgactccgtg gaacactgat gtttgtgaca 300 
ggggagtaca gcaccagcca tcagcaggcc agtaaatcat accggcctgc gaaattggac 360 
tcagacccgg atccaccctg accgacgtcc caagccccca ccccccaccc cccaccatgg 420 
gccgagatcc agtcctcttt gaatagggcc tggccgtggt tcacgggaca tctgagacat 480 
tgccgaggcg ctgcattggt ggatcttgcc agaagtttgc ccagtgcaga tttgggcaga 540 
atctcaaact gccttgggat gtaggagaga aaccaggcct ggtcaagttc atgggaagag 600 
gtggaaacag accccatagg ctggggcttg ggcagctgta ggaagccctc tctgctgcct 660 
ccctgcctgc tctctgcttt gaagcatctt ccccagtgcc cccagtctca tgccctctca 720 
acgttggggt caaatcctga ggaataccca gactggctct ctgggccaaa gaggaccctc 780 
tccagaaaga gcagggccca gtgcggcttc ctaaagggca ggggaagggc ctggccactc 840 
cccagaggct actcaccagc catcaggata gccccaggaa gcaggccttc tcgagcccat 900 
tttattactt tattttatta ttttatttaa ttttaaattt attttttgag acagagtotc 960 
actctgttgc ccaggctgga gtgcagtggt gcgatctcaa cccactgcag cctctgcctc 1020 
cagggttcaa gggattctcc cacctcagcc tcccaagtag ctgggattac aggtgcccgc 1080 
caccacaccc ggctaatttt catattttta gtagagatga ggtttcacca tgttggccag 1140 
gctggtctcg aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtgctagg 1200 
tcaagcccat tttaaagttg aagaaactga ggctgaggta aattccctcc ccagggatcc 1260 
tgctgcagcc agaaggtggt aaaacaggac ttcacccggg tctgtctggc gtgaaaggca 1320 
gtgttcugt accaccctag ggggcctgag agaactgagt ccctcgggca taactgacag 1380 
ttctgttccc attattccgc aggggctcgg atctggctgt atgctttcca ggatggcctt 1440 
ggagacccac ataagcccta caccctttgg gaagctgcat gt^gttgg ggtgccgtca 1500 
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ctgtcatcca ggctggagtg cagtggtaca atctcagctc actgcaagct ccgactccca 71280 
ggttcaagtg agtctcctgc ctcagcctcc cgagtagctg ggactacagg tgcgcgccac 71340 
cacacccgcc cagctaattt ttgtattttt agtagagatg gggtttcacc atgttggcca 71400 
ggatgatctc gatctcttga cctcgtgatc cgcccacctc ggcctcccaa agtgctggga 71460 
ttataggcat gagccactgt acccagctga ctcttagtca cttttaagaa ggggactgtg 71520 
ccttcatttt tcactgggcc ctgcagaata tatgcctggg ctctgggctc ttctgaacct 71580 
gtgttggctt ccatctgacc tctctgtgcc agcccaaggc tgctgctctt cctgagggca 71640 
aggagcccca tgactgcgtg ttgactcgct ggatggggct gctgagccca ctctgccaca 7 1700 
ccacgtgccc ctggcaggga gggaatccct gggtcctcac aggaacagtc agcaagccac 71760 
acctgacgcc tgctgtgggc ccatccctgc ggtgctggag aagacagaca aggcctggtc 71820 
actgcctctg cagggtcccc agtccgtgga aggagacagt aatctaggca ttttcggtgg 71880 
ggaagctgag ctgttctcgt gtcctgaagg ccaggcggga acagccgtct tcagagggaa 71940 
gggagaaaat gcacatcgca tcagtggaga agggcctgac ttccctcagc atggtggagg 72000 
gaggtcagaa aacagtcaag cttgagtatt ctatagtgtc acctaaata 72049 

<210> 10 
<211> 8705 
<212> DNA 
< 213 > Homo sapiens 

<400> 10 

ggactcaggg gcagcaggga ggtacaccca tggttagtgg gcggaccata gggggtaatg 60 
agagggtgaa tcgatggaac ctgggggaca caatcgaagt ggttccagag tcgggctgta 120 
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gaaccacccc catgattcaa tcatctccca ctgggtccct cccacagcac gtgggaatta 69900 
tgggagtaca attcaagatg agatttgggt ggggacacag ccaaacccta tcggttgcca 69960 
acatttacag taacagtgtt aggtgaacag ttgtccagtc tcctgttitg tcggacactg 70020 
tttctagcac cttccaggca gaatctcatg tatccttcac tttcgaaatg ggtactattt 70080 
catccccact tttatcaatg agaaactaaa gctcgaagag gtcaagtaag ttcctggcca 70140 
aggtcagcta gcaggctcta gaggcctcgt tctccttaga ggcagccttg ccagggccca 70200 

ggcttggcaggctgcagggcaggtgcgggcatgcccatggtagaggtgggaccattgagg 70260 
ctcagagagg gtaagtgatg agccctggcg acacagcggg gtgggtccag agtccggcct 70320 
gcatcttctg gagctggcca gtggacaggc ctttcccgtt cacagccccg gggctgctgt 70380 
gcccaccagg gcggatgtgc ctaccgaatc ccactcctct gtgtgtgtcc ctttcaggcc 70440 
ctacatcatt cgaggaatgg cgcccccgac gacgccctgc agcaccgacg tgtgtgacag 70500 
cgactacagc gccagccgct ggaaggccag caagtactac ctggatttga actcggactc 70560 
agacccctat ccacccccac ccacgcccca cagccagtac ctgtcggcgg aggacagctg 70620 
cccgccctcg cccgccaccg agaggagcta cttccatctc ttcccgcccc ctccgtcccc 70680 
ctgcacggac tcatcctgac ctcggccggg ccactctggc ttctctgtgc ccctgtaaat 70740 
agttttaaat atgaacaaag aaaaaaatat attttatgat ttaaaaaata aatataattg 70800 

ggattttaaaaacatgagaaatgtgaactgtgatggggtgggcagggctgggagaa^tt 70860 
gtacagtgga gaaatattta taaacttaat tttgtaaaac agaactgcca ttcttttgtg 70920 
ccctgtgtgc atttgagttg tgtgtccccg tggagggaat gccgaccccc ggaccaccat 70980 
gagagtcctc ctgcacccgg gcgtccctct gtccggctcc tgcagggaag ggctggggcc 71040 
ttgggcagag gtggatatct cccctgggat gcatccctga gctgcaggcc gggccggctt 71100 
tatgtgcgtg tggcctgtgc cgtcagaaag ggccctgggc ttcatcacgc tgttgctgtt 71 160 
cgtcttcctc agattcttag tctttt^ 7122Q 
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gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 58860 
gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 58920 
agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 58980 
actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 59040 
tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 59 100 
gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 59160 
aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 59220 
ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 59280 
cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 59340 
tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 59400 
gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 59460 
gggccccagc tgtgggggcg ctgttgatgc gcagccaggc ctcgccgcca gagcccgcac 59520 
gcttccattc cgctgacttc atcgacgccc tcaggatcgc tgggccggcc ctgtgggaga 59580 
gtgaatgtgg cttttgccaa agttgagtct ggagcctgga aacttcccta tgggcagcct 59640 
tgatagtgga gtggcccaag gagcccaccc agccgaccct gcccctcccg tggctggtgg 59700 
gcggcaccag gggctgcctg gctttgctcg ttcaccaaca tcacccgggc tggccagggc 59760 
gcgctcactt ctgccaccac cgagggccct gggcgaagga gtgaatacca ggctgccttg 59820 
gcagggatgt gttgagggct gtggggagtc ggacagcggc gggggtcaga ggaggaggag 59880 
ggtgcaccgt gcaggctgaa gggccacgtt accctgaggt tggccaggct ccccaggcct 59940 
agcctcccag ctcccccact ttctccccac cctccaccag tggcaaagcc agccccttca 60000 
gggcgcacgg tgtctgcccc caaggagggc ccattccgtt ggggttaatg ttggccacct 60060 
ctttctgttt gtctctggca gaaatcacca agccgccctc agacgacagc ccggcccaca 60120 
gcagtgccat cgggcccgtc attggcatca tcctctctct_cttcgtcatg ggtggtgtct 60180 
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gtggcacttg tggaaggtgc agacctgtgt gggtgtgtgg gcccagggcc cctggtccct 1560 
tcctcccttt gtagggctgg ttgtgtgctg cctggacctg gggggcacgt tcacgtggtg 1620 
aatttgtcta tttactatcc ccgctttggg gctggtgcca gcacaggccc ttgtgaaggg 1680 
ggtgcctttg tctggagtgg gactgtggcc cctccctcag cgtggtgact tctgtgtcag 1740 
ggcttcagca gggacgcaga gcccctgagt gttcggaaca agggcgtcat tgcaggagtt 1 800 
agactgtgtg tgatggaggg aggaggggca ggaggaaagg tcagaaggag agttcctggg I860 
aaggtccctg aggagcctgg tgaggtgcta actggtgtgg aggacactca gggcctgtgg 1920 
ggacatctcc tactgctggg ggccagccac aaagggaact ggccgaagtc ctgtccccgc 1980 
cttcacagcc cagcatctgg tcacaaggca ggtacttgga agggcgcggg cacctgggcc 2040 
aaaagtgcct gggttccctt tgcctttcac tgagatgacc ttcggggcag gtggctgctg 2100 
cctcccctcc tgtccccagg ttttgccaac tggccagagg aaggggtcct gggaagcagg 2160 
ggggccagaa gccctctctg caaggaaagc ccgaggggtg tgggaggaag gaaggaatgc 2220 
ccaggctggc gaggctctaa gtcaccctgg cttggctctc ctcagatcct gaacccgccg 2280 
ccctccccgg ccacggaccc ctccctgtac aacatggaca tgttctactc ttcaaacatt 2340 
ccggccactg cgagaccgta caggtaggac atcccctgca gccctccatg gccattgggt 2400 
tcccgccagc ccgtggtgga ggggcctaat ccccatgcca ctgatgaggg gaggtattct 2460 
gggtgctaat gggcaggtgc cgggcccagc cctgcctccc tctgctctgc caaccacact 2520 
aggctgcctc cccagacaag ctcagcgggc actgcatgtt gggttcagaa atcagcagaa 2580 
ctccacgttc tgagctgctc ttcaagttgc tcctatgggg gttactttta agctgggaaa 2640 
tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 2700 
cggctgtgga aacctagcca ttgagatgcc ccctcttggt ggctctgtcc tcttaggatg 2760 
ggacaagtct gtgaaggctg ctgcagcacc caccgtagac ccctaatcgt gtgacgtcac 2820 
caggatggtc cgggctgctc acttgccaca gtggcctgtt tgagcccggg aagccaacgg 2880 
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tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 5700 
cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 5760 
ggctctgggc tcttctgaac ctgtgttggc ttccatctga cctctctgtg ccagcccaag 5820 
gctgctgctc ttcctgaggg caaggagccc catgactgcg tgttgactcg ctggatgggg 5880 
ctgctgagcc cactctgcca caccacgtgc ccctggcagg gagggaatcc ctgggtcctc 5940 
acaggaacag tcagcaagcc acacctgacg cctgctgtgg gcccatccct gcggtgctgg 6000 
agaagacaga caaggcctgg tcactgcctc tgcagggtcc ccagtccgtg gaaggagaca 6060 
gtaatctagg cattttcggt ggggaagctg agctgttctc gtgtcctgaa ggccaggcgg 6120 
gaacagccgt cttcagaggg aagggagaaa atgcacatcg catcagtgga gaagggcctg 6180 
acttccctca gcatggtgga gggaggtcag aaaacagtca agcttgttgc tgggtgacag 6240 
tgcatttaat aatcaaaata taggctgggt acggtggctc atgcctgtaa tcccagcact 6300 
ttgggaggct gaggcaggtg gatcacttga ggccaggagt ttgagaccgg cctggccaac 6360 
atggcaaaac ctcaactact aaaatacaaa aactagccgg gcgtggtggt gcacgcctgt 6420 
aatcccagct acttgggagg ctgaggcagg agaattgctt gaacctggga ggcggaggct 6480 
gcagtgagcc gagattgtgc cactgcactc cagcctgggc aacagagcaa gactctgtct 6540 
caaaaaaaaa aaaaaaaaaa gcaatacaaa atacaaatat cactttcact aaaagaaggg 6600 
atggaagacc caaaacaaac agaaaacaac aaaatggcag gagtaagtcc ccacttatca 6660 
ataataacat tgactgtaaa taggctaagc tctgcaatca aaagagtggg ccaggagcgg 6720 
tggctcacgc ctgtaattcc aacgctttgg gaggctgagg cggatggatc atttgatgtc 6780 
acgagtttta agaccagcct ggccaacaag gtgaaacccc atctgtacta aaaatacaaa 6840 
aattagccag gcggtagtgg cacgcacctg taatcccagc tacttgtgag gctgaggcag 6900 
gagaatcact ggaggctggg aagcggaggt tgctgtgagc caagatggag ccactgcact 6960 
cccacctggg cgacagagtg agatcctgtc ttaagaaaaa aaagagtgga tgaatggatc 7020 
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gtaatgaacg tgaccaggag ctgcttactg aggacgcact ggatgatctc atcccttott 660 
ttctactgac tggtcaacag acaccggcgt tcggtcgaag agtatctggt gtcatagaaa 720 
ttgccgatgg gagtcgccgt Ggtaaagctg ctgcacttac cgaaagtgat tatcgtgttc 780 
tggttggcga gctggatgat gagcagatgg ctgcattatc cagattgggt aacgattatc 840 
gcccaacaag tgcttatgaa cgtggtcagc gttatgcaag ccgattgcag aatgaatttg 900 
ctggaaatat ttctgcgctg gctgatgcgg aaaatatttc acgtaagatt attacccgct 960 
gtatcaacac cgccaaattg cctaaatcag ttgttgctct tttttctcac cccggtgaac 1020 
tatctgcccg gtcaggtgat gcacttcaaa aagcctttac agataaagag gaattactta 1080 
agcagcaggc atctaacctt catgagcaga aaaaagctgg ggtgatattt gaagctgaag 1 140 
aagttatcac tcttttaact tctgtgctta aaacgtcatc tgcatcaaga actagtttaa 1200 
gctcacgaca tcagtttgct cctggagcga cagtattgta taagggcgat aaaatggtgc 1260 
ttaacctgga caggtctcgt gttccaactg agtgtataga gaaaattgag gccattctta 1320 
aggaacttga aaagccagca ccctgatgcg accacgtttt agtctacgtt tatctgtctt 1380 
tacttaatgt cctttgttac aggccagaaa gcataactgg cctgaatatt ctctctgggc 1440 
ccactgttcc acttgtatcg tcggtctgat aatcagactg ggaccacggt cccactcgta 1500 
tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg tctgattatt 1560 
agtctgggac cacggtccca ctcgtatcgt cggtctgata atcagactgg gaccacggtc 1620 
ccactcgtat cgtcggtctg attattagtc tgggaccatg gtcccactcg tatcgtcggt 1680 
ctgattatta gtctgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctgga 1740 
accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1800 
atcgtcggtc tgattattag tctgggacca cgatcccact cgtgttgtcg gtctgattat 1860 
cggtctggga ccacggtccc acttgtattg tcgatcagac tatcagcgtg agactacgat 1920 
tccatcaatg cctgtcaagg gcaagtattg acatgtcgtc gtaacctgta gaacggagta 1980 
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ggtgggtgct tcgtggcgtg gttctgaaac ttcgttggaa gtgtgtggac agtgccttgc 3420 
ctgttctctg tgggacccta tttagaaacg aggtctgagt tactgggggt catcactgtg 3480 
ttctgatggc ccagctgtgt ggaggccgcg gtgcagcccc atccaaggag ccagggccct 3540 
gggtctagcc gtgaccagaa tgcatgcccc ggaggtgttt ctcatctcgc acctgtgttg 3600 
cctggtgtgt caagtggtcg tgaaactctg tgttagctct tggtgttcct gaaagtgccc 3660 
ccgggtctca ggcctcagaa ccagggtttc ccttcatctc ggtggcctgg gagcatctgg 3720 ' 
gcagttgagc aaagagggcg attcacttga aggatgtgtc tggccctgcc taggagcccc 3780 
ccggcacggt gctggggcct gaagctgccc tcgggtggtg gagaggaggg agcgatgaag 3840 
tggcgtcgag ctgggcagga agggtgagcc cctgcaaggt gggcatgctg gggacgctga 3900 
gcagcatggc cagcagctgg gtctgcagcc tggtacccgg cgggacttgt ggttggggct 3960 
ggtttgtggc caggagaggg gctggcagga gacaaggggg actgtgaggc agctcccacc 4020 
cagcagctga agcccaatgg cctggctgtg tggctctcag ctgcgtgcat aacctctcag 4080 
tgcttcagtt ctctcatttg taaaatgagg aaacaaacag tgccagcctc ccagaggtgt 4140 
catgaggatg aacgagtgac catgtagcat gggctgggtg cgtgtcacct aacatcacca 4200 
gcctttgcaa ggagagccct gggggcctgg ctgagtattt cccttgcccg gcccacccca 4260 
ggcctagact tgtgcctgct gcaggccctt gacccctgac cccattgcac ctgtctccac 4320 
aggagccgag gaggtgctgc tgctggcccg gcggacggac ctacggagga tctcgctgga 4380 
cacgccggac ttcaccgaca tcgtgctgca ggtggacgac atccggcacg ccattgccat 4440 
cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 4500 
cagggcgtac ctggacgggt ctggggcgca gacgctggtc aacaccgaga tcaacgaccc 4560 
cgatggcatc gcggtcgact gggtggcccg aaacctctac tggaccgaca cgggcacgga 4620 
ccgcatcgag gtgacgcgcc tcaacggcac ctcccgcaag atcctggtgt cggaggacct 4680 
ggacgagccc cgagccatcg cactgcaccc cgtgatgggg taagacgggc gggggctggg 4740 
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acctcggtgt gcggttgtat gcctgctgtg gattgctgct gtgtcctgct tatccacaac 2040 
attttgcgca cggttatgtg gacaaaatac ctggttaccc aggccgtgcc ggcacgttaa 2100 
ccgggctgca tccgatgcaa gtgtgtcgct gtcgacgagc tcgcgagctc ggacatgagg 2160 
ttgccccgta ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga 2220 
tgcagatcaa ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc 2280 
tccacgcacg ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg 2340 
atccgacagg ttacggggcg gcgacctcgc gggttttcgc tatttatgaa aattttccgg 2400 
tttaaggcgt ttccgttott cttcgtcata acttaatgtt tttatttaaa ataccctctg 2460 
aaaagaaagg aaacgacagg tgctgaaagc gagctttttg gcctctgtcg tttcctttct 2520 
ctgtttttgt ccgtggaatg aacaatggaa gtccgagctc atcgctaata acttcgtata 2580 
gcatacatta tacgaagtta tattcgatgc ggccgcaagg ggttcgcgtc agcgggtgtt 2640 
ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 2700 
catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 2760 
tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2820 
cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 2880 
tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 2940 
gaattcgagc tcggtacccg gggatcctct agagtcgacc tgcaggcatg caagcttctc 3000 
ttgtgccggt tgtacgctgt caggtcacac tggtgagtta ggcagggcac agatgcccag 3060 
agcagaggga actttccttg gggattcaac acgtgcaagt cttaggggct ggcaaatcct 3 120 
gccctcagct agagaggggg cttttatttg agaccagaat cacctgagca tcctcctgtc 3 180 
cccagctgtg tccagcctgt ctgcagggac atcctgagag gaccaggctc tcccctcatc 3240 
cacctgccta agtgccactc tgaaccctgt ccacctgtgc cgtggagggg cgtgacctca 3300 
agctgctcag ccagcagcag gcttggccct ggggggcagc agagacccag gtggctgtgg 3360 
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gcctggagcc agggccaggc caagcacagg cgagagggag attgacctgg acctgtcatt 4800 
ctgggacact gtcttgcatc agaacccgga ggagggcttg ttaaaacacc ggcagctggg 4860 
ccccaccccc agagcggtga ttcaggagct ccagggcggg gctgaagact tgggtttcta 4920 
acaagcaccc cagtggtccg gtgctgctgc tgggtccatg cgtagaaagc cctggagacc 4980 
tggagggagc cctttgttcc cctggcttca gtttcctcat ctgtagaatg gaacggtcca i 
tctgggtgat ttccaggatg acagtagtga cagtaagggc agcctctgtg acactgacca 
cagtacaggc caggcctctt tttttctttt tttttttttg agatggagtc tcactctgtc 5 160 
gcccaggctg gagtgcagtg gtgtgatctc agctcactac aacctctgcc tcctgggctc 5220 
aagtgattct cctgcctcag cctcctgagt agctgggatt acaggtgcct gccactgtgc 5280 
ttggctaatg tttgtatttt tggtagagat ggggtttcac cgtcttggcc aggctggtcg 5340 
caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagtgctg ggattacagg 5400 
catgagccac cacgcccggt caggccaggc ctcttttgaa cactttgcac accatgggtc 5460 
ttttcatcca ggggggtagg tacagttgta cagttgagga cactgaagcc cagagaggct 5520 
cagggacttg cccagggtca cacagcagga tgtggcaggt gtggggctgg gcctggcagc 5580 
gtggctccag ctttccagca tagaaatctg tgaaagcaga tagtttgtcg gtcggtaggg 5640 
gagactttct gagacccgcc ccagcggctc agagggtagt agccaggggc cttectgggg 5700 
gctcataacc cagaacactg aatgggaaaa ccctgatgga ggaggcgcag tggagctgtg 5760 
ggtgccgatg ggaagtccca gaggagctgg gaggtcagta gcggtgctgc cctctgtgga 5820 
gcacttagtg ggcaccaggt gtgtttccag gttcatggcc ctgggacctg aagctcagaa 5880 
ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttgtgggctg 5940 
tggagcctgt gctcgtggcc cagccctggg ggttgtgagt gtgctggccg gggagctttt 6000 
cctgcaagtg gactggtgtc taggagccag catgtcaggc agcaggcagc gggagtgcag 6060 
caggcagcgg gagcacagca ggcagagggc ggggctcga^ cagccatccg tggaccctgg 6V. 
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ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 6180 
cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 6240 
ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 6300 
aagtcactgt ctacatgggc ctcggtgtcc tcatctgtga aaaaggaagg gatggggaag 6360 
ctgactccaa ggcccctcct agccctggtt tcatgagtct gaggatccca gggacatggg 6420 
cttggcagtc tgaSctgtga ggtcgtgggg tccagggagg ggcaccgagc tggaagcggg 6480 
aggcagaggg gctggccggc tgggtcagac acagctgaag cagaggctgt gacttggggc 6540 
ctcagaacct tcacccctga gctgccaccc caggatctgg gttccctcct tggggggccc 6600 
cagggaacaa gtcacctgtc ctttgcatag gggagccctt cagctatgtg cagaaggttc 6660 
tgctctgccc cttcctccct ctaggtgctc agctcctcca gcccactagt cagatgtgag 6720 
gctgccccag accctgggca gggtcatttc tgtccactga cctttgggat gggagatgag 6780 
ctcttggccc ctgagagtcc aagggctggt gtggtgaaac ccgcacaggg tggaagtggg 6840 
catccctgtc ccaggggagc ccccagggac tctggtcact gggcttgccg ctggcatgct 6900 
cagtcctcca gcacttactg acaccagcat ctactgacac caacatttac aaacaccgac 6960 
attgaccgac accgacattt accgacactg acatttacca acactgttta ccaacactga 7020 
catctactga cactggcatc taccaacact gacatttacc gacactgaca tttaccaaca 7080 
ctatttacca acactgacat ctactgacat tggcatctac caacaccaac atttaccgac 7140 
accaacattt accaacactg aaatttaccg acaccgacat ttaccgacac cgtttaccaa 7200 
caccgacgtt taccgacacc gacatttacc gacactgata tttaccaaca ctgacatcta 7260 
ctgacgctgg catctactga caccgatgcc agcatctacc aacaccgaca tttaccaaca 7320 
ctgacattta ctgacactga tatctactga cactggcatc tactgacacc aacatttacc 7380 
aacaccagca tctaccaaca ccgacattta ccaacaccag catttaccaa caccgatgtt 7440 
taccaacgcc gacgtttacc gacgccagca tctaccaaca ctgacattta ccgacaccga 7500 



WO 01/92891 PCT/US01/16946 

167 

catttaccga cactgacatt tactgacact gacatctact gatactggca tctaccgaca 7560 
ctgatattta ccaacgccag catctactga cactgatgtt taccaacacc gacatttacg 7620 
agcaccgaca tttactgaca ccaatattta ctgacatcaa catttagcca tgtgatgggg 7680 
gccggcttgg gggcaggcct tgctcttggc actggggatg ctgcagagac cagacagact 7740 
catggggtca tggacttctg cttcttctcc agcctcatgt actggacaga ctggggagag 7800 
aaccctaaaa tcgagtgtgc caacttggat gggcaggagc ggcgtgtgct ggtcaatgcc 7860 
tccctcgggt ggcccaacgg cctggccctg gacctgcagg aggggaagct ctactgggga 7920 
gacgccaaga cagacaagat cgaggtgagg ctcctgtgga catgtttgat ccaggaggcc 7980 
aggcccagcc accccctgca gccagatgta cgtattggcg aggcaccgat gggtgcctgt 8040 
gctctgctat ttggccacat ggaatgcttg agaaaatagt tacaatactt tctgacaaaa 8100 
acgccttgag agggtagcgc tatacaacgt cctgtggtta cgtaagatgt tatcattcgg .8160 
ccaggtgcct gtagacacag ctacttggag actgaggtgg gaggatcgct ggagtccaag 8220 
agtttgaggc cagcccgggc aaaggggaca caggaatcct ctgcactgct tttgccactt 8280 
actgtgagat ttaaattatt tcacaataca aaattaagac aaaaagttaa tcacatatcc 8340 
actgccctgc ttaagacaga aaacatgggt gttgttgaag ccagaggcag ctgctggcct 8400 
gagtttggtg attggttcct aagcagttga aggcagtttt gtttttccat agatgtctgt 8460 
tctccctttg ctgggtgcag cctcgccctg ctgctgtggt cgggtttcag tggcctcgtc 8520 
ccgtggacgc agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac 8580 
gcagcctcgc cctgctgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc 8640 
gccctgccgc tgtggtcggg tttcagtggc ctcgtcccgt ggacgcagcc tcgccctgcc 8700 
gctgtggtcg ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac 8760 
ctgtggagcc tctcttgagc LLLlLlgLll gttgtttgtt tttgtttgat tttgtttgat 8820 
tgtttgtttt tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct 8880 
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cactgaaacc tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc 8940 
tgggattaca agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg 9000 
gtttcaccat gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc 9060 
ggcctcccaa agtgttggga ttacaggcgt gagccaccgc gcccagcctc tgttgagcat 9120 
attttgaggt tctcttggtg ccagtgatat gtacatgtgt ccccatcgca ccatcgtcac 9180 
ccattgaggt gacattggtg cctctcctcg gggtggatgc ctccctctgt ttccagcaac 9240 
ttctgaagga ttttcctgag ctgcatcagt ccttgttgac gtcaccatcg gggtcacctt 9300 
tgctctcctc agggctccca ggggaggccc gaatcaggca gcttgcaggg cagggcagga 9360 
tggagaacac gagtgtgtgt ctgtgttgca ggatttcaga ccctgcttct gagcgggagg 9420 
agtctcagca ccttcagggt ggggaaccca gggatggggg aggctgagtg gacgcccttc 9480 
ccacgaaaac cctaggagct gcaggtgtgg ccatttcctg ctggagctcc ttgtaaatgt 9540 
tttgtttttg gcaaggccca tgtttgcggg ccgctgagga tgatttgcct tcacgcatcc 9600 
ccgctacccg tgggagcagg tcagggactc gcgtgtctgt ggcacaccag gcctgtgaca 9660 
ggcgttgttc catgtactgt ctcagcagtg gttttcttga gacagggtct cgctcgctca 9720 
cccaggcgag agtgcagtgg cgcaatcacg gctcgctgta gcctcaatct ccctgggctc 9780 
aggtgatcct cctgcctcac cctctgagta gctgggacta cagacacata ccaccacacc 9840 
cagctagttt ttgtgtattt tttgtggggg gagatggggt ttcgctgtgg tgcccaagct 9900 
gatctcaaac tcctgaggca caagcgatcc acctgcctcg gcctcccaaa gtgctgggat 9960 
gacaggcatc agccgtcaca cgcagctcaa tgattttatt gtggtaaaat aaacatagca 10020 
caaaattgat gattttaacc attttaaagt gaacagttca ggctgggcgt ggtggcttat 10080 
gcttgtaatc ccagtacttt gagaggctga ggtgggcaga tcacctgagg tcaggagttt 10140 
gagaccagcc tggccaacat gatgaaatcc agtctctact aaaaatacaa aaattagccg 10200 
ggcatggtgg caggtgcctg taatcccagc tactcgggag gctgaggcag gagaatcgct 10260 
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tgagcccggg aggtggaggt tgcagtgatc tgagatcatg ccactgcact ccaatctgtg 10320 
tgacagagca agactctgtc ttgaaaaata aataaataaa aaaaatttta aaaagtgaac 10380 
aattcagggc atttagtatg aggacaatgt ggtgcaggta tctctgctac tatctacttc 10440 
tagaacactt tcttctgccc tgaaggaaac cccatgccca ccggcactca cgcccattct 10500 
cccctctctc ccagcctctg tcaaccacta atctactttc tgtctctggg ggttcacttc 10560 
ttctggacgt tttgtgtgac tggaatcctg caatatgtgg tccctgcgtg tggcttcttt 10620 
ccatagcatt gtgttttcca gattcaccca cacattgtcg cacgttatca gaatctcatt 10680 
cctgactggg tgcagtgggt taggcctgta atcctaacat tctgggaggc caaggcggga 10740 
cgatcacttg aggcaggagt ttgagaccag cctggccagc ctagcaagac cccagctacc 10800 
aaaaaatttt aaaagttaac tgaacgtggt ggtggtgggc acttgtggtt cccagctacc 10860 
tgggaggctg aggtgggagg atcgcttaag cccaggaggt caaggctgca gtgagctatg 10920 
atcgcaccac tgcactccag cctggacaac agagcaagac cctgtctgaa aaaaaaaaca 10980 
aaaaaaaaag ttcctttctt tttgtggctg gatgacatcc cattgtatgg ccacagcaca 1 1040 
ttttgtttgt ctgtttatcg ggtggtgggc agtggtttcc accttttgtc tcctgtgaat 11100 
aatgctgctg tgaacatttg aattcaagtt tttgtttgaa cacctgttgt gaattatttg 1 1 160 
gatatatgtg taggggtagg attgctgagt cctatggtaa tgttaggttt gacttactga 1 1220 
ggaaccatta aactgttttc aacagtggct gcgccgttct gcatccccac cggcagtgtg 1 1280 
tgagggttct gactttacct cctcacaaac gcttcttttc catttaaaaa aatattcagc 11340 
caggtgctct ggctcacgcc tgtaatccca gcactttggg aggccgtggc gggcggatca 1 1400 
cctgaggtca ggagttcgag acgagcctgg ccaacatggt gtaaccccat ctctaccaaa 1 1460 
aatataaaaa ttagccgggt gtggcagcgg gcgcctgtaa tcccagctac ttgggaggct 1 1 520 
gaggcaggag aatcacttga acccgggagg cagaggttgc agtgagccaa gatcgcgcca 11580 
ctacactcca gcctgggtga caagagtgaa actccatcta aaataaaaca aaaataaaaa 1 1640 
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tccagcacac actcaccctg tctgtgcacc tgtttttgtg tccgtaagtg ggtatttact 32640 
caccttacga gtgagccact gtgggaattc agggaggtgg cgcagtgacc acccctggag 32700 
ggatatgtgt gtggcagggg tcgagggtct cgcccttccc tgcttcctgc gcgtggcttt 32760 
ctccaggacg gggagggctg agctgaagag gtggggacag ttgcgtcccc ccgccaccca 32820 
ctgtcctgcg gtgagagcag actcactgag cctgcccttc tcccttgtgc cttccagcta 32880 
catctactgg accgagtggg gcggcaagcc gaggatcgtg cgggccttca tggacgggac 32940 
caactgcatg acgctggtgg acaaggtggg ccgggccaac gacctcacca ttgactacgc 33000 
tgaccagcgc ctctactgga ccgacctgga caccaacatg atcgagtcgt ccaacatgct 33060 
gggtgagggc cgggctgggg ccttctggtc atggagggcg gggcagccgg gcgttggcca 33120 
cctcccagcc tcgccgcacg taccctgtgg cctgcaagtt ccccaacctg gcaggagctg 33180 
tggccacacc cacgactgcc cagcagcctc accctctgct gtgggagttg tccccgtcca 33240 
cccctgggtg cctttgctgc agttatgtcg ggagaggctc tggtgacagc tgtttcctgt 33300 
gcacctgctg ggcactaggt cccagctaat ccctgtgcca ggactctaat ttcaccctaa 33360 
cacacatggt ggttttcatt gctggggaag ctgaggcctg agcacatgac ttgccttagg 33420 
tcacatagct ggtgagttca ggatccccca gagataccag ggccagcact cgatccccac 33480 
ccagccctga accccaccat gtgctgggat tgtgctggga gtgtccacac gcctgggacc 33540 
ccagggctgg tgctctcatc tcctttttcc agatcatgag aatgaggctc agggaagttt 33600 
gaaaaaaacc tatcccaagt cacacagcaa caggagcagg atttgaaccc agaaaagggg 33660 
accgcacact ctgttctgct agagtagtta gctgtcctgg gtgatatggc aggtgacagg 33720 
ggcaactgtg cttaacaaag gaacccccat cccccctgcc aagttgggag actagaaggt 33780 
caggggcaga agctctgaag ggccaggtgc agtggctgac acctctaatc ccagcacttt 33840 
gtgaggccaa ggcgggcaga tgatttgagc ccaggagttc aagatcagcc tgggtaatgt 33900 
agtgagacgc catctctaca aaaaaatttt ttaaaaatta gctgggcatg gtggttcatg 33960 
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cctgtagtcc aagctacttg ggaggctcag gtgggaggat tgcttgagcc caggaggttg 34020 
aggttgtggt gagctgtgat catgccactg cactccagcc tgggcaatag agtgagaccg 34080 
tctccaaaaa aaaaaaaaga agaagaaaaa gaagctctga ggctccaagt ccccaggcac 34 140 
cccttggctt gagggcagac aagggaggag agggtcacct gggcagccct gacttttgtc 34200 
ccctggcaaa gggaccttca gtgaccttgg ccctaggaga gcctctgagc acgtcagcca 34260 
tgtcgaaccg ctcaggaagg gcagcaagaa tttggcttct gacctctgcc tctcctactc 34320 
gccatctgca ctgggtgtgg ttgtgcccat tttacagatg aggaggctgg ggcatcgacc 34380 
agctgaatgc cttgtcccag gtactgcgta ggcagagctg gcagttgaac cccgtgtcct 34440 
ggttgtcgct gggggtgggc tgcaccctga cttgtgaggc cagtagcaag gtttgcacgt 34500 
gacttcgtga ccgtcaccca gctctgcagc acatcccgtg acccagctca tccaggccgc 34560 
atgcaaacct gttgccaggc gagaaaccag tcaccgcaca gctgtggttg cctgaaatga 34620 
ttaagctcat taatcacccc ggagtgagga cagactcaga tgaaaaccag caaaagccct 34680 
ggaaactcat gtgaccctgc caatgagggc ggccatgtgc attgcagcct ggccgtcact 34740 
cctcggtacg tgttttggac ttaaacgctc cggatgttta ctgagtgctt gattaataac 34800 
atggaaggcc tggtctcatt gctgtgggag tgaaggatgc acagccaggc ctgacatgat 34860 
gagaacaaga acctggagtc tcgctgcctg ggtggtaatc ctggccctgc cacttagcaa 34920 
ctgtgtgact gtagccaggt cacttaattt tgctagatcc tgcctgcgct tcagtggatc 34980 
ttgctggttt tccaaggtgg ccaaacactt taaggcattc atgtggtcgc taggctgcag 35040 
ggttgaaccc tggctcaccc cgcagggcgc cgtgtgctct gtggcctggc tgtgcctttg 35100 
ctgacaccgt gcccgtgtgt gttcatgcag gtcaggagcg ggtcgtgatt gccgacgatc 35160 
tcccgcaccc gttcggtctg acgcagtaca gcgattatat ctactggaca gactggaatc 35220 
tgcacagcat tgagcgggcc gacaagacta gcggccggaa ccgcaccctc atccagggcc 35280 
acctggactt cgtgatggac atcctggtgt tccactcctc ccgccaggat ggcctcaatg 35340 
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actgtatgca caacaacggg cagtgtgggc agctgtgcct tgccatcccc ggcggccacc 35400 
gctgcggctg cgcctcacac tacaccctgg accccagcag ccgcaactgc agccgtaagt 35460 
gcctcatggt cccccgcacc tcactccctc gttagatcag gctggttctg ggagctgacg 35520 
ctgaaaggag cttctcatct ggggttcctg ggtgtacata gatggttggg taggttgtgc 35580 
actgcacaag ctgcatgatg ctacctgggg gtccaggtcc aggctggatg gacttgttgc 35640 
ttcatcagga catagataaa tggccaaaac tcctcagctg gaaggtcctg ggcaggatct 35700 
ttgggtgtga aaaccagtca caggggaagg gtgcttgctc atactgccag cacagtgctg 35760 
agtgctttcc atagcgctcg tttactcctc aagcctggag ggtggggagt agcatggtcc 35820 
catttcacgt acaaggaacc cgatgcacag agaggtgtgg caacccatcc aaggccatac 35880 
aaaggggtg ggttgagccg gggttgactg tggcaggctg gctcaagagt ccctgctcct 35940 
gaacccttgc caggcagcct ggcatcagcl cggggaattt ttgccctgac ccttggaagc 36000 
aagtgggcct ctttgttctc atgtcagtga tgagaagagt gactttccta tggcccctct 36060 
ggagtacagg tgtttcctgt tggcgggctc ttcccccatg acatcagcag cgagctggtt 36120 
atgattccct acgcagaact tgatagttta taaagctctt tgtcatccag gccccgttgg 36180 
agtctcacgc agacctggtc gcaggcgggg ctggtcttgc ctgtcccagc tgcatggatg 36240 
gggaacttga ggcttgcaaa ggttaagggg ctgttcgagg cccacgctgg caggagatgg 36300 
gcctgggcca gagtctggga cttcccatgc ctgggctgtc tttggtcctg ttgctcacca 36360 
tccctccctg gggccatgac cttagagagc caaatggagg tgcaggtaac ccacggcaag 36420 
gaggggttgc catgactcag agtccccgtc ctgtggccgg cagtacctgg tgcaacgact 36480 
tggatttcag accagccact gtagcccgct gacggtgcgc tcgaagtgcc acagcttctg 36540 
aagccaggca ggactcaggc caggagactc tgttagctgt tgagagggag aggccaacgg 36600 
atgttctggt tctgctagag agctggttct tcggatcctg gtaccagtgc actgagagga 36660 
ggcccagctt gattctgggg ctgccttgtg gtggcatgtg ctgctcactg acaccctcga 36720 
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gccagggcac agccctcaaa atctccatca atgacatgta agaaaagaga ggaacctggg 38160 
aaatagcaaa gtgccttttg cacattaaat ggttagctat atcccacaat actgtgcatt 38220 
cgtaaacgtt aatgctgcaa taaatacggc acttcacctt gggaagatct ggagttggct 38280 
tatgagtgtg gaagggtgta gcgcatgagt ttttgtgaaa cactggaagg aggattgtgg 38340 
gaaatcaaat ggaaagttct caccccaggc gtggagaaga gtgggtcatg gccccagcag 38400 
tgagcccagg gaggtcagag acggaggtgt gtgtgtgggt gtgaccctgc gcagttccct 38460 
gccggctgta gttttttgca ttcgcttaat gtttctcgtg gaggaaattg tgcatgagca 38520 
aatgtgaaac cgtgctgtgc tcaaattgtc ctaatacatc attgcattgg aacagattgg 38580 
ctttnttttt tttttttttt tttttttttt tttgaaatgg agtctcactc tgtcaccagc 38640 
ctggagtgca gtggcatgat cttggctcac tgcaaccttt gcctcctatg ttcaagtgat 38700 
tttcctgcct cagcctcctg agtaactggg attacagggc atgagccacc gcggccggcc 38760 
agatttgcat ttttgaaaca actgctaggc tgggcgcggt ggctcacacc tgtaatccca 3 8820 
gcactgtggg aggccgaggc aggtggatca cctgaggtca ggggttcgag accagcctgg 38880 
ccaacatggt gaaaccccgt ctctactgaa tatacaaaaa tcagctgggt gtggtggcgg 38940 
gtgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acccaggagg 39000 
cagaggttgc ggtgagccga gatcacacca ttgcactcca gcctgggcaa caagagcaaa 39060 
actccatctc aaaaaataaa aaatagaaaa acaagtgctg tagcggaagt gagcactttg 39120 
cggagtcagg cttgtgtggc ctgttccaca aatgatgtgc tcacggtggc ctcaggccca 39180 
cctggagtct gcagcatggg gcacaacagg ttcattagtg tagaattcca ggacaggcct 39240 
ggctcctaag cagccttctt ttacaaaaac tgcagagccc gcctgtatcg tagcactttg 39300 
ggaggccgaa gtgggtggat cacgaggtca ggagttcaag accagcctgg ccaacatggt 39360 
gaaaccccat ctctactaaa tatacgaaaa ttagctgggt gtggtggcac gcgcctgtag 39420 
tcccagctac tcgggaggct gaggcagaat tgcttgaacc tgggaggtgg aggttgcagg 39480 
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gatctgagac catgtcattg cactccagcc tgggcaacag agcgagacgc catctcaaaa 39540 
aaaaaaaacc tacagagcca cacggcctct ttctccaccg agtgttggtg tgggagcttg 39600 
tgttattgtg gtgaaatctt ggtactttct tgaggcagag agaggctgag cgcctggaga 39660 
gactttcaca tgggtcgcca tgtccgccgt cggtttcgct gttgtgctcc ccatctgaag 39720 
gctggtgccg tccagacagg ctggacgccc ctttccacca gatccttcct cccgcagcag 39780 
tttctagtta cgttgtactg tgaggtctgt gtccttggtt gatggcaaaa gtcagccgaa 39840 
ttgaaattca gagccatgcc tggctccctg gagcttctct cctgggcagc tgtgatcatt 39900 
gcctctgctg tggtgtgggt ggtggaaatg gattcctttc atcttgcttg ctacaggtga 39960 
ctgtcacgtg gagtcctttg gagagaggga cgtgttaatt gatggatgtg gctcccatgc 40020 
tgagaaagct cctgggcgta cattgcctta gagtttcatt ggagctgcgt tcttttatgg 40080 
tgtctgctag gcagaagtga tgaagacttg gaagaaaacc cagaaggttt tccacttaat 40140 
ttggaaaatg tgcttttccc ctcctgtgtc ttttgctaag gtccagcctc ctgcagcctc 40200 
cccgctctgt ggactctggc tttgattctt tattaggagt ccccctgctc ccccaaaaga 40260 
tggtgtctaa attatcatcc aattggccga ggttttgttt tctattaatt gtttttattt 40320 
tttattgtgg taaatttata taacataaaa tttgccattt taattgtttt gttattgttg 40380 
tttttgagac agggtctcac cccagtgccc aggctggagt gcagtggtgc gatcatggct 40440 
cactgcagcc tcagcctcca gggctccagt gatcctctca cctcagcctc tctagtagcc 40500 
gggactacag gcatacacta ccacatctgg ctgatttttt gtattttttt tttattgtag 40560 
agacccgcta tgttgcccag gctggtctca actcctggac tcaagccatc ctcccacctc 40620 
accctcccaa agtgctggga ttacaggcat gagccacaac acccagccat tttaattttt 40680 
LLLLLLLLLL ttgagatgga gtctcactct atcgcccagg ctggagtgca gtggcgtggt 40740 
atcaactcac tgcaacctct gcctcccagg ttcaagcgac tctcctgcct cagcctcctc 40800 
ccgagtagct gggattacag gtgcccatca ctatgcctgg ctaatttttg tattttttag 40860 
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ggagtgtctt ctctcgggct tgttgactgt gcccggtttt ccgcagttca ctggtgcaca 36780 
cataggcaca tagcaaaccg cacacacagt cgtgggtatg agtttcacta cattccacca 36840 
ccagtgttca ctaccattac ctgccttccg tcttaagtgt tcatcattta aaaataaatt 36900 
tattgggctg gacgcggtgg ctcatgactg ttatcccagc actttgggag gctgaggcgg 36960 
gcagatcacc tgaggtcagg agttcaagac cagcctggcc aatatggtga aactccatct 37020 
ctactaaaaa tacaaaatta gctgggcatg gtggggcatg cctataatcc cagctactca 37080 
ggaggctgag gcaggagaat ggcgtgaacc cgagaggcag agcttacagt gagcccagat 37140 
agcaccactg cagtccagcg tgggcaacag tgcgagactc catctcaaaa aaaaaataaa 37200 
taaataaaag aaaaataaat ttatgatcta tttcaaaaat aacacatgta ctttgaaaca 37260 
gcagagacac atatgacacg gagaatgaaa ttccccatag cgcaccccca agagacagcc 37320 
ctggtccccc cgtctttccc gtggacctcc agcggggcag atgctgagcc gcctgttgtc 37380 
gagtggcatg ctatcccgtc ctccagctcc tctgtggctt acagacaccc acctgcagcc 37440 
ctgtctttgc ctcctctagc gcccaccacc ttcttgctgt tcagccagaa atctgccatc 37500 
agtcggatga tcccggacga ccagcacagc ccggatctca tcctgcccct gcatggactg 37560 
aggaacgtca aagccatcga ctatgaccca ctggacaagt tcatctactg ggtggatggg 37620 
cgccagaaca tcaagcgagc caaggacgac gggacccagg caggtgccct gtgggaaggg 37680 
tgcggggtgt gcttcccaag gcgctcctct tgctggtttc caggctgctg cccctgtcct 37740 
tagcagaggg aggaaacaga ggatggctct gggtgaatga tgacttgggc ttcgattatg 37800 
tagtcacagg gtatgaccct gagatgcgtg gaaccccgag actgtgatta tatgtagaaa 37860 
ctgggtttcc ccgttgttta agtagtcatg gtggggtcag accccacagg acttttgtct 37920 
tttcaagaaa gaaaatggtc gtgtgtcatg caggggtagt tggtactggt taatccaggt 37980 
ttatccttta ttttgtggga actgtacagt catttctgct acaatgctgt atatgctctt 38040 
ctgaaagaca cctatgcaaa atcgcacagt aaaaatgaca caactcatag ggaaagcggg 38100 
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cagagacggg gtttcaccat gttggccagg ctggtcttga actcctaacc tggtgatccg 40920 
cccgcctcgg cctcccaaaa tgctgagatt acaggtgtga gccaccgtgc ccggcctttt 40980 
tttgtttttg agacagggtc ttgccctgtc acccagactg gagtgcaatg gtgggctctt 41040 
ggctcactgc agcctccgcc tcccaggctc aagttgtgca cctccacacc tggctaactg 41100 
tattttatgt agagacagat ttcaccatgt tgcccaggct gggcttgaaa tggactcaag 41160 
cagtccaccc acctcagcct cccaaagtgc tgagattaca ggcgcgagcc accgcaccca 41220 
gcccatttta cctattctgc agttgacagt tcagtggcat tcagtcagtt cacgaggtaa 41280 
ccatcactgc cattcatctc cagactactt caccttctcg gcagatgtcc gaaactgtcc 41340 
gcattgaaca cactcctcat ctccctctga cagccaccat tctactttgt atctctctct 41400 
gccttctcta ggtacctcat gtaagtggaa ttataccaat atttgccctt gtgtgactgg 41460 
cttctttcat gtgacatggt gtcctcaagg ttcatctgtg ttatagcctg tgtcagaatt 41520 
tccttcctta aagcctgaat aataacccgt tgtaaaggct gggcgcggtg gctcacaccc 41580 
tctaatccca gcattttggg agtccgaggt gggcagatca cttgaggtca ggagtttgag 41640 
accagcctgg ccaacatagt gaaaccctgg ctctactaaa agtacaaaat tagctgggtg 41700 
tggtggcgcg cacctgtaat cccagttact caggaggctg aggcaggaga atcgcttgta 41760 
cccgggaggc agaggttgca atgaaccaag attgtgcctc tgcagtccag cctgggtaac 41820 
agagtgagac ttcctgtctc aaaaaaaaaa aaaatcatcg gatggatgga cggaccactt 41880 
cttgttattt atccatccac gggtgctagg tttcttccac ctttggttgt cgtgaataag 41940 
gccactatga acatttcctt ccgtggtgaa ggttttgtac tagtgaggaa aaggcgtgtt 42000 
tgtggtgttg cataggattc tggtaagaaa gtttgcacta accataagta tttgtactac 42060 
attaaaatga aagctcaggg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 42120 
ggaggccagg gcgggcggat catgaggtca ggagatcaag accatcctgg ccaacatggt 42180 
gaaaccccgt ctctactaaa aataccaaaa aactagccag gtgtggtggc gggcacctgt 42240 
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ttcaagtgat cctcttgcct cagcctcctg agtagctggg attacaggtg cacgccacca 51960 
tacccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gtcaggctgg 52020 
tctcgaactc ctgacctcgt gatccgccca cctcggcctc ccaaagtgct gggattatag 52080 
gcgtgagcca ctgtgcctgg ccattaggtg tgttttatoa cccagcatca tgcagtttat 52140 
cttggtgaat gttctgtgta ctcttgaaaa gaatgtggat tctgctgttg ttgggtggag 52200 
tgttccagaa acatcaatta gatccagttg gttaatagtg ctcatcaggt tgtctctatc 52260 
cttccttcct gactgcctgc ttgagctgtc agttattgac aggggtgtgg agtctccaac 52320 
tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 52380 
tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 52440 
aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 52500 
tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 52560 
gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 52620 
ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 52680 
gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 52740 
atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 52800 
gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 52860 
cctttttttt tttttttttt ttttttttga gatggagtct cgcttttttc gcgcaggctg 52920 
aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 52980 
cctgcctcac cctcccaagt agctgggatt acaggctccc accaccacgc ccggctaatt 53040 
tttgtatttt cagtagagat ggtgtttcac catgttggcc aggatggtct tgaactcctg 53 100 
acctcaggtg atccgcccac ctcagcctcc caaagtgcag ggattacagg cgtgagccac 53160 
cacacccgga gtgccggttg tttttagcag tttgtcttgt tcctggagag actggctcct 53220 
gcccaggagc tcggggagta gggccgcggg gtgctgcctc acacctcgag tttggccgta 53280 
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attttgtgtg ccagcgcgtg gtgtgccagc gctatgcggg ggccaacggg cccttcccgc 60240 
acgagtatgt cagcgggacc ccgcacgtgc ccctcaattt catagccccg ggcggttccc 60300 
agcatggccc cttcacaggt aaggagcctg agatatggaa tgatctggag gaggcaggag 60360 
agtagtctgg gcagctttgg ggagtggagc agggatgtgc taccccaggc cctcttgcac 60420 
atgtggcaga cattgctaat cgatcacagc attcagcctt tcccactgag cctgtgcttg 60480 
gcatcagaat ccttcaacac agaggcctgc atggctgtag caacccaccc tttggcactg 60540 
taggtgtgga gaaagctcct tggacttgac cttcatattc tagtaggaca tgtgctgtgt 60600 
tgtccacaaa tcctcatgta ccctagaaat gaatgtgggg gcggctgggc tctctccaga 60660 
gctgaaggaa tcactctgta ccatacagca gctttgtctt gagtgcagct gggatttgtg 60720 
gctgagcagt tacaattcct acgtggccca ggcaccagga acgcaggctg tgtttgtaga 60780 
tggctgggca gccgcaccgc agagctgcac catgctggtt tgtatcacat gggtgaccat 60840 
ggtatgtcta agaaggtgga gtccctgtga ggtctgcagg tgcccccaca gctccaggcc 60900 
accttgagga ttgcctctgc ctgcccagcc ctgagttccc tctcccctgt cctgtcccac 60960 
tgtcacccca agccggcctc attgggagcc tgttggatgg cagggtatag atgtaacctg 61020 
attctctctg gggagcgggg ttatctggct tctcaagagc tcctaggagc ccacagtggt 61080 
ggcaccatca cagtcgcagc agcccccaga gaacgcggcc ctgtctgttc ctggcgtgct 61140 
ctgtgctgcc ccgcctgggt tccctgcccc agtcgcaggc cccttggagg aggtaccatg 61200 
tgtctcccgt ttcacagatg agccccgggg agctcactct agtagtggcc agagaggcct 61260 
gcggctcagg gagcggggca catttccaac aggacacacc gccctggtct gagtctcgtg 61320 
ggtagtggga gcagaggaga gcgccctatg tctgtggggc ggcttggctg agcctggaag 61380 
ccacctgacc tcccccgtcc cttccctgcc aggcatcgca tgcggaaagt ccatgatgag 61440 
ctccgtgagc ctgatggggg gccggggcgg ggtgcccctc tacgaccgga accacgtcac 61500 
aggggcctcg tccagcagct cgtccagcac gaaggccacg ctgtacccgc cggtgagggg 61560 
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tggactcgag cgatcctcct gcctcggcct ccccaggtgc tgggattaca ggcgtgagcc 63000 
accgtgcctg gcctggggta ttgtcttctt atggcacctg actgtggtgg gccctgggaa 63060 
ggaagtagca gaagagggtt cttcttggtt tcctggacag taactgagtg ttctggaggc 63 120 
cccagggcct ggctttgttt agggacaaag ggaactggta accagaagcc gagagtttaa 63 180 
acacccactg cccttcttcc ctgctcctgc tgctgcaacc cagcttaacc agccaggagt 63240 
gctaggaacc caagcagggc ccccgagcac acagcaggca gctcacgaat tctcttttcc 63300 
tgttctccct tgggagctgg gaggatctta atcaggcaat aagagatggc actgagcagc 63360 
cagctaattt tttaaatcac tttattgttt aaccatatga ctcacccact taaaaaaggg 63420 
tacagttcag tgggttttag tgtattcaca gatgtgtgca accctcacca cagttaattt 63480 
tagaacattt tcctgcccct aaaagaaact ctgcatgaag ccagctgttt ttaaattagc 63540 
aaagttattt tgcatccttt aaatatatgt tcatggtaca aaattcaaaa gatacagaag 63600 
agtctgcagt ccaaagagac tccgccccca tgacgccaag caggactccc tgggaggcat 63660 
ggcctcctgc agtgtgtttc ttctatgtcc ccccaggggt catctgtaca tatgcaagca 63720 
tacaagagcg tggactttgt tttccaagcc agaagataat tgtagattta tgtgcagttg 63780 
tgagaaagag cacagaccca tttatcctct gcctggtttc ccccagtgct gcctgccatc 63840 
ttgcatgact tccattccta tcataagcaa gacactgata acgattcttt caccttattc 63900 
agattgacat aagtgttttt tgtttgttct tgagacaaac ttcctctgtc acccagtggg 63960 
agtgcagtgg cacaatcaca gctcactgca gcctcaaact cctgggctca agcgattctc 64020 
ctgcctcagt cccctcaagt agctcagatg gcaggtgtgc accatcatgc caggctaatt 64080 
tttaaatttt ttgtggaggt gaggcctcac taaatttcct gggctagtct tgaactcctg 64140 
agctaaagtg atcctcctgc ctcagcctcc caaagtggta ggattacagg catgagccac 64200 
tgcgcctggg ctgacatatg tgttttcgta agcccgaaag atagcatctg aagagtcaac 64260 
attgagcctt gccttttgct gctaatgatg tataaaagct gctgttctga gcatttcgga 64320 
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ggggg ctt gg gctccaaact ctgactgtgt gtttgagtcc ggctgtggaa acctagccat 68520 
tgagatgccc cctcttggtg gctctgtcct cttaggatgg gacaagtctg tgaaggctgc 68580 
tgcagcaccc accgtagacc cctaatcgtg tgacgtcacc aggatggtcc gggctgctca 68640 
cttgccacag tggcctgttt gagcccggga agccaacggg gctgctcagc tggacaccag 68700 
ccccccgagc tgcccatgtt ggggtcacag gccccacctc cctggttggg gaggggcaac 68760 
tgagagtgtg gagaggtggg acccaggtgt gctggtctcc gcaggggctg gatcagagcc 68820 
tgggatgggc agggtgagcc tcctgacctt taacccagtg gtgtcaggca acgtggccca 68880 
cccgccagcc gcaccaggcc ccacccccgc aggtgaaggg gtgggatagg ctgggcctgg 68940 
gccaggacac ctctggacca cgcattcctc attgcttggg tccctggagc agcagggcct 69000 
cccgagtgtg gtgccgcctg ccacctagtg gccatttcca cgaactccca ggcctggctg 69060 
gggagccgga actgcagcct ccatttccac cccactccgg gtcgggccac ctccctgatg 69120 
cctcagtatt atatcaaact gtcacagtct gtcccacagc cttacagacc actgtctcca 69180 
gaatggtcac atccacactg ggcagcccag tctcgctagt tcctcgtccc acctcctgcc 69240 
tttgctcatg cccgtcctgc tctgggccca ccgcggacac atcttccccc cgcccgccgt 69300 
ctgacctcac agcagctggg ccccaagagg agtatcctgt cctgctgcac ttttctcaac 69360 
acccggtgtt ggctgcacct tcccacccat tgcaggcccc tctgtgacag gacgggggct 69420 
cctaaacaca ccacagttcc gagtctgaac tcacacagtg ggatgcggcg tttctgggcc 69480 
acagttgggt gcaggtagcc tctgggagga tgggaggtca ggagccatct tgcgagtcag 69540 
gttgcttgaa ctcaggatgg aagtgttccg ggcccattgg ttgctgtatt agcctgttct 69600 
cacgctgcta ataaagacat acccaagact gggtaattgt aaaggaaaga ggtttaacgg 69660 
actcacagtt ccacctgcct ggggtggcct cacaatcatg gtagaagaca aggaggagca 69720 
agtcacatct tacatggctt cagggaacag acagcatgag aaccaagcga aaggggtttc 69780 
cccttgtaaa accatcaagt ctagtgagat ttattcacta ccacga^aac agtatggggg 69840 
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ggctgctcag ctggacacca gccccccgag ctgcccatgt tggggtcaca ggccccacct 2940 
ccctggttgg ggaggggcaa ctgagagtgt ggagaggtgg gacccaggtg tgctggtctc 3000 
cgcaggggct ggatcagagc ctgggatggg cagggtgagc cicctgacct ttaacccagt 3060 
ggtgtcaggc aacgtggccc acccgccagc cgcaccaggc cccacccccg caggtgaagg 3120 
ggtgggatag gctgggcctg ggccaggaca cctctggacc acgcattcct cattgcttgg 3 180 
gtccctggag cagcagggcc tcccgagtgt ggtgccgcct gccacctagt ggccatttcc 3240 
acgaactccc aggcctggct ggggagccgg aactgcagcc tccatttcca ccccactccg 3300 
ggtcgggcca cctccctgat gcctcagtat tatatcaaac tgtcacagtc tgtcccacag 3360 
ccttacagac cactgtctcc agaatggtca catccacact gggcagccca gtctcgctag 3420 
ttcctcgtcc cacctcctgc ctttgctcat gcccgtcctg ctctgggccc accgcggaca 3480 
catcttcccc ccgcccgccg tctgacctca cagcagctgg gccccaagag gagtatcctg 3540 
tcctgctgca cttttctcaa cacccggtgt tggctgcacc ttcccaccca ttgcaggccc 3600 
ctctgtgaca ggacgggggc tcctaaacac accacagttc cgagtctgaa ctcacacagt 3660 
gggatgcggc gtttctgggc cacagttggg tgcaggtagc ctctgggagg atgggaggtc 3720 
aggagccatc ttgcgagtca ggttgcttga actcaggatg gaagtgttcc gggcccattg 3780 
gttgctgtat tagcctgttc tcacgctgct aataaagaca tacccaagac tgggtaattg . 3840 
taaaggaaag aggtttaacg gactcacagt tccacctgcc tggggtggcc tcacaatcat 3900 
ggtagaagac aaggaggagc aagtcacatc ttacatggct tcagggaaca gacagcatga 3960 . 
gaaccaagcg aaaggggttt ccccttgtaa aaccatcaag tctagtgaga tttattcact 4020 
accacgagaa cagtatgggg ggaaccaccc ccatgattca atcatctccc actgggtccc 4080 
tccxacagca cgtgggaatt atgggagtac aattcaagat gagatttggg tggggacaca 4140 
gccaaaccct atcggttgcc aacatttaca gtaacagtgt taggigaaca gttgtccagt 4200 
ctcctgtttt gtcggacact gtttctagca ccttccaggc agaatctcat gtatccttca 4260 
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ctttcgaaat gggtactatt tcatccccac ttttatcaat gagaaactaa agctcgaaga 4320 
ggtcaagtaa gttcctggcc aaggtcagct agcaggctct agaggcctcg ttctccttag 4380 
aggcagcctt gccagggccc aggcttggca ggctgcaggg caggtgcggg catgcccatg 4440 
gtagaggtgg gaccattgag gctcagagag ggtaagtgat gagccctggc gacacagcgg 4500 
ggtgggtcca gagtccggcc tgcatcttct ggagctggcc agtggacagg cctttcccgt 4560 
tcacagcccc ggggctgctg tgcccaccag ggcggatgtg cctaccgaat cccactcctc 4620 
tgtgtgtgtc cctttcaggc cctacatcat tcgaggaatg gcgcccccga cgacgccctg 4680 
cagcaccgac gtgtgtgaca gcgactacag cgccagccgc tggaaggcca gcaagtacta 4740 
cctggatttg aactcggact cagaccccta tccaccccca cccacgcccc acagccagta 4800 
cctgtcggcg gaggacagct gcccgccctc gcccgccacc gagaggagct acttccatct 4860 
cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 4920 
cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 4980 
tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 5040 
gggcagggct gggagaactt tgtacagtgg agaaatattt ataaacttaa ttttgtaaaa 5100 
cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 5160 
tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 5220 
ctgcagggaa gggctggggc cttgggcaga ggtggatatc tcccctggga tgcatccctg 5280 
agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 5340 
cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt tttttttttt 5400 
ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagtggta caatctcagc 5460 
tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 5520 
tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 5580 
tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 5640 
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aaaaaacaag acccaaccat ctcttgcata caagaaacac actttaccta taaaaacaca 7080 
ctaggccagg tgtggtggct cacacctgta atcccagccc tttgggaggc ctgactggca 7140 
gatcacctga ggccaggagt ttcagaccag cttgaccgac atggcaaaac cccatctctc 7200 
ctaaaaatac aaaaaaacaa aaaaaagaaa aaggctggaa gtagtgatgt gtgcctgtag 7260 
ccccagctac ttgggaggct gaggcaggag aattgcttga atccgggaag tggaggttgc 7320 
agtgagccag gatggtgcca ctgcactcca gcctgggtga cagagcgaga ccctgtcata 7380 
aaaaaaaaaa gaaaagaaaa gaaaaacgag aaaaacaaac acaaaattag tagaagaaaa 7440 
gaaataataa agatcagaac aggccaggct catgggcaca gtggctcaac tcctacctgc 7500 
tcaggagttt gagaccagtc tggccaacat ggcaaaaccc catctctcct aaaaatatga 7560 
aaaaaaaaaa ataggctgga tgtggtgatg tgtgtgtgcc tgtagcccca gctacttggg 7620 
aggctgaggt gggagaatca cttgagccca ggaagtggag gctgcagcga gtcatgaatg 7680 
caccctgcac tctagctggg taactggagt gagattctgt ctcaaaaaag caaagaccag 7740 
agcagaaata aatgaaatgg aaatgaagga aacaatgcaa aatgatacaa aaagtttttt 7800 
cgaaaagata aacaaaatca acaaaccttt agccagatta agaaaaaaag agagaagacc 7860 
caaataaata aaatccgaga ttaaaaagga gacattacca ctgataccac agaaattcaa 7920 
aggatcatta gaggcaacta tgtgcaacta tatgctaatg aactggaaaa cctagaagaa 7980 
ctgggtaaat ttctagacac atacaaccta tcaagattga accatgaaga aatccaaaac 8040 
ctgaacaggc cgggcacggt ggcttacgcc tgtaatccca gcactttgga aggcctgaga 8100 
tcaggagttc gagaccagcc tggccaacat ggtgaaaccc catctctact gaaaaaatat 8160 
aaaaattagc cgggcgtggt ggcgggtgcc tctaatgtca gccactcggg aggctgaggc 8220 
aggaaaatca cttgaacctg ggaggcatag gttgcagcga gccgaggttg caccactgca 8280 
ctccagcctt ggcgacagag ccagactcca tctcaaaaaa attaaaataa caaaaacctg 8340 
aacagaccaa taacaagtaa tgcgatgaaa actgtaataa aatgtttccc aacaaagaaa 8400 
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gcccaggaac aaatggcttc actgctgaat tttaccaaac attttttttt ttttgagacg 8460 
gagtctcgct ctgtcgccca ggctggagtg cagtggtgta acctcggttc gctggtaact 8520 
tatgcctctc aggctgcaag tgattttcct gcttcaggcc ccccgagtgg ctggaaatta 8580 
gatggtactt gtcaaacaag gcctggctaa atttctatat ftccttcaag tagaagatgt 8640 
gcttccaaca aaggttgggt tacggctggc ttctgaaaat cttggatttc aaggctcccc 8700 
aaaag 



8705 



<210> 11 
<211> 66933 
<212> DNA 
<213> Homo sapiens 

<400> 11 



tataatcaag cgcgttccgt ccagtccggt gggaagattt tcgatatgct tcgtgatctg 60 
ctcaagaacg ttgatcttaa agggttcgag cctgatgtac gtattttgct taccaaatac 120 
agcaatagta atggctctca gtccccgtgg atggaggagc aaattcggga tgcctgggga 180 
agcatggttc taaaaaatgt tgtacgtgaa acggatgaag ttggtaaagg tcagatccgg 240 
atgagaactg tttttgaaca ggccattgat caacgctctt caactggtgc ctggagaaat 300 
gctctttcta tttgggaacc tgtctgcaat gaaattttcg atcgtctgat taaaccacgc 360 
tgggagatta gataatgaag cgtgcgcctg ttattccaaa acatacgctc aatactcaac 420 
cggttgaaga tacttcgtta tcgacaccag ctgccccgat ggtggattcg ttaattgcgc 480 
gcgtaggagt aatggctcgc ggtaatgcca ttactttgcc tgtatgtggt cgggatgtga 540 
agtttactct tgaagtgctc cg gggtgata ^ 
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taaataaaaa tttattaaaa cattcatcac agccagccta gtgggtgtcc catgtggctt 1 1700 
tgcctcgcat ttccctgata actaggatgc tgagcgtctt gtcccaggct tgccacacct 1 1760 
cagcactttg agatacgtcg cacagtcccc atttgcgaac gagaaatgag gtttagggaa 11820 
cagcagctgt gtcatgtcac acagcgagca gggggtctct gagccgtctg accccacagc 1 1880 
cgaccaagct ccaatcctta ccgcctccta gtgttgtgga tgtagcccag ggtgctccca 1 1940 
catttttcag atgagaacac cgaagctcaa aacaggagcg ttttgtccac attggataca 12000 
cgatgtctgt ggtttggtcc tgaagtcact ttatatctca gtggtccaga ctggagtagg 12060 
acagggggtt ctggggaatg gggaaggtgt ctcaggtgaa aggaaggaat tccagattct 12120 
ccatactgtc cttgggaagt tagaagactc agagggtctg gcaaagtcag acaaagcaag 12180 
agaaatgcag tcaggaggaa gcggagctgt ccaggaacag gggggtcgca ggagctcacc 12240 
cccaggaact acacttgctg gggccttcgt gtcacaatga cgtgagcact gcgtgttgat 12300 
tacccacttt tttttttttt ttgaggtgga gtctcgctct cttgcccagt ctggagtgca 12360 
gtggcacgat ctcggctcac tgcaagctct gcctcccggg ttcatgccat tctcctgcct 12420 
cagcctcccg cgtagctggg actacaggcg cctgccaccg cgcccggcta atttttgtat 12480 
ttttagtaga gatgggattt cactacatta gccaggatgg tctcgatctc ctgacctcat 12540 
gatccgcccg tctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgcccgg 12600 
cccgatttcc cactttaaga atctgtctgt acatcctcaa agccctatac acagtgctgg 12660 
gttgctatag ggaatatgag gcttacaggc catggtgctg gacacacaga agggacggag 12720 
gtcaggaggt agaagggcgg agagagggaa caggcggagg tcacatcctt ggctttcaaa 12780 
atgggccagg gagagacacc ctctgagcat ggtaggacag gaaagcaaga ttggaacaca 12840 
ttgagagcaa ccgaggtggc tgggcgtggt ggcttacgcc tgtaatccca acactttgga 12900 
aagctgaggt gggtggattg cttgaggcca ggagttcaag accagcctgg ccaacatggt 12960 
gagaccccgt ctctactaaa tatacaaaaa ttagccaggc gtgatggtgc atacctgtaa 13020 
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tcccagctgc ttgggaggct gaggcaggag aattgcttaa acctgggagg cggaggttgc 13080 
agtgagccga gatcccgcca ctgcactcca gcctgggcca cagagtgaga ctccatctca 13140 
aaaaaaaaaa aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac 13200 
agtagcagaa gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta 13260 
cctgacgatt tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga 13320 
gtgatggcgc ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct 13380 
cactgcaggc cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat 13440 
acattgcata agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc 13500 
cctggtggcg cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac 13560 
ctgcacgtcc agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca 13620 
cctcggatgt gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt 13680 
ttcctcctgc aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct 13740 
gcttctcagg tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg 13800 
gatgcgccag tgttcctgct gtggaccccc atctcacatg agggtcttgg gcctgcaggc 13860 
tcgttcagga aacacccgct gagtacgcag tgtgtgccag ctgtgtccca ggcaatggcg 13920 
gggacagtgg ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt 13980 
gcaaggagcc acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg 14040 
gaggtataaa tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg 14100 
ctatgtcggg aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg 14160 
acgtgttttg ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg 14220 
gtggtgtgcg cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg 14280 
gctagccggg ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg 14340 
agaggacttc acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc 14400 
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tatgaaatga ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg 14460 
agcggagacg gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt 14520 
cctgggaatc agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc 14580 
cccatcctgc gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag 14640 
ccctgggcac cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt 14700 
gcttggagcc gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg 14760 
aggtcccctg attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt 14820 
ttttccagcc tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc 14880 
cttagccctc ttcctgagtc tagagccccg ggggttggaa gttctggccc ctgggacacc 14940 
tgcagccaca ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc 15000 
acgttcttgc ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct 15060 
cccctctgcc ctgcccttct cccctctacc ctgcccttct ctcctctgcc ccgccatggc 15120 
ttttatatcc tgtgccacaa gacatggctg tgtgtgaaag tggcagggtc tggcatctct 15 180 
gtgggtctct gaggcccacg ctccagtgcc actcttccca cccgctggcc gtgccctcat 15240 
gctggaggga cagcccagcc ctctcccgaa ccccagcccc atgtgcccag ctgcccccgg 15300 
ccctctcccc tggaagccgg ggtcactcca gccgtatgcc atggtgggga catcctgctt 15360 
ccttggcctt ccagggaagg tcctctttcc aaatggcgac acctggtccc tgcctggagg 15420 
ctggaagctg tggcccttgt atgcccctcc agggtctgtg cgctcggttg gcccgagttc 15480 
ccatcaccgt catcatcacc atcatcattg tcatttcgct tgtctgtgag ccggcctggt 15540 
ctcccagagc agagaccctc tgaggtccag cctgagttgg ggtctccgtg ctgacccctg 15600 
acggggactc aggacgtacc aggtctgggt caggagtgac ccccaaacct cgtgcccttt 15660 
gacaggcacc cctgactttt gctaagtggg tggaggtgac atcacttaca gcgggagtga 15720 
tgggacaggg tctgttggct gcactgtgct cccagggatc tggggagagg ctatatccct 15780 
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gggctttggc actgcagagc tgtgtgtgtt tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 15840 
tgtgtgtgtg tgtgtgtgtg tttgcgtgcg cgcacatgtg tataagatct ttttttatta 15900 
catgaagcaa gataactgtt gctgtttcct tttgggtttt gtgttcaaca gagtggggta 15960 
cttcttccct cagacaacag aactctcccc tttaaacacg tgctgtcaga gggtgggtct 16020 
tgggctcatg tctgtttgca cagccgagtc agaggaaaca cagggttctt cataaaaaca 16080 
ctgcacagca ggcgactgtc cagagtcagc ctgcaggacg gcagcagccc tgcccctcag 16140 
agcacagcta gggtgggctg ctttgggatc tcccgtcatt ccctcccagc tggcagccgg 16200 
cggccggccc attccttggt gtgctggtca ggggggcgtg cgcctgctct gctcaccctg 16260 
ggaatgggac agaagctggc agctcggaga ggacagggct ggacccttgg gtggcctctg 16320 
gctggaccat ctcattgtcc tcagacacag cctctcgggt ctagtttcat ttcctgaaaa 16380 
acaagtgcac agaactagag caggagtcga gagctacggc ccccgggcca gatccagccc 16440 
tgccacctgt tttcacacca tgctcaagct gagtgggttt tacatttttt aattacttga 16500 
aaaaaaaaaa gccaaaggag gtttcatgac ccatgaaaat tatatggaat tcaaaaaaaa 16560 
aaaattatat ggaattcaaa tttcagtgtc cataaataat ttcttgagac agggtctcgc 16620 
tctgtcaccc aggctggagt gcagtgctat ggcatggctc gctgtaccct tgacctccca 16680 
ggctcaagcg atcctcctgt ctcagcctcc tgagtagctg ggactacggg tgtgtgccac 16740 
caagcccggc taattttttt ttaattttag taaagacagg gtctttctat gttgcccagg 16800 
cttttctgga actccatctt ggcctcccaa agtgctggga ttacaggctc gagccacgga 16860 
gcccagcctg tttttgtttt ttcactgata aagttttgcc gggtgtggta gtgtgtgcct 16920 
ctagcgattt gggaggctga ggtgggagga tcgcttaagc ccaggagttt gaggctgggc 16980 
tcaagtgatc aggaggtgaa ctatgatcat gtcattgcat tccagcctgg gtgacagagc 17040 
aagaacctat ctcttaaaaa tatatattta aaaagtattg ggtgtggtgg ctcacgcctg 17100 
tggtcccagc tacttaggca tctgaggtgg gaggatggct tgagcccagg agtttgaggt 17160 
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tgcagcgagc caagatcgtg tcactacact ctagcctggg tgacagagcc cagaccctgc 17220 
ctctttaaaa aaaaaaacca aaaaacatgt attggaacac agccatgcct gttcagtcac 17280 
gtgctctcca tgctgctttc tgctccagag acccttatgg cctgaaagct gaaaatattt 17340 
tctatccttt acaaaaaagt ttgctgacct ctgtcctgga aaattcatct cccaagttct 17400 
cttccggcac tggcgttcct gggtgtccta aatttggccc ctgttatttc tgaactctgt 17460 
tttggctctg ttccctccca ggagccagga caggcacgtt ctctgcatct tgtcccctga 17520 
cgcccagagg cttggctcgg ctcaggcatt cttggaaata tctggctcca ggaaaggcag 17580 
aggcctcctg agtcagccca gagggaacct gccccaggtc tgggggaggc ctgacccagc 17640 
agagtggctt ttgccgatgg gttgggccgg tcaagatgtg ctgaaagttg tcctcagaag 17700 
gccactttgg gattccttcc tccagtatta gagcaactga gagctgctca ttgcaagcct 17760 
gatgttttcc cagttggccg ggtccaccgg gtgccctggg attctgggat ctgggtggaa 17820 
agtagggggc ttgggggagt gtcctgggtt ctggaatcca ggtggcaagt ggtgaggttc 17880 
agggagtggc ttctgagcca ccataggggt ctctgtggga ggctctgccc atccaggaga 17940 
ttccgcaggc cctgccggcc cagagccagc gtcttgcgct tgccgaggct acagccagcc 18000 
ccagccgggt ggaacagccc gtcgcctcct ctcactttgt tttggggcca cctgggagtg 18060 
tggagcaagg gtagagaggg aggaagtggc tgccggccgc tgcccagcac ccttgtttgc 18120 
cttgggccct ctgtgggctc ctttttattg ctcttcaatg aagccaggga aatggacttc 18180 
cttgcctcac ttcagttcaa catgtctgga agtttggtat taaaattaag aaagtgtgga 18240 
aatagagcaa gaagagaaaa atctctccaa gagataatag tgacctctga gctgggcgcg 18300 
gtggctcacg cctgtaaatc ccagtacttt gggaggctga ggcgggcaga tcacctgagg 18360 
tcgggagttt gtgaccggcc tgaccaagat ggagaaaccc cgtctctact aaaaataaat 18420 
aa?taa?taa ataaataaat acaaaattag ccaggcatgg tggcgcctgc ctataatccc 18480 
agctaaggca ggagaatcgc ttgaacctgg gaggcaaaggjtgcagtgag ccaagatcac 18540 
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gccattgcac tctagtctgg gcaacaagag tgaaactccg tctcaaaaaa aataaataaa 18600 
taaaaaataa aaatagtgac ctctggccag gtgtggcagc tcatacccgt aatcccagca 18660 
ctttggaagg aaggccgaga tgggcagatt gctttagcac aggagtttga gaccagcctg 18720 
gccaacatgg tggaacccca tctctacaaa aatagaataa aatttaagag gtaatagtga 18780 
ccttttggta gatcgaaacc tggattgctt tctttttcta aatgctgatt cttttctttg 18840 
tggtgtttgt gttctgtgcc gatgtccctc ccccagccct gttattgtga gtggaagaag 18900 
gggaaagggt tcgcccgcta ctgtgagccc ctcctctcac gctgggtgtc cttggagaag 18960 
cctgcacttc ttcattgtac gccagggctg ggtccctccc tggagtggtt ctgtgctgct 19020 
gggatggggc caacccctca gatgttttct gagtgtcaca cacaggtgtg tgcattcatg 19080 
gcctttgcgt gtcttcctgt tgtggaggca aaaatgtgaa gaaccctaga tgattttggg 19140 
accagggctc catcacctgc tgttcattgc acaccggagc atccaggcat gggtggagag 19200 
ctcagacttc caggcacggt cgcaggggct ggtctaacca tgttcccgcc cgcctgctcg 19260 
tcagaaccgc ctgttgggag ctgttatcat gataccatac ctgggccctg ggctatccga 19320 
ttctgactta attgctccag gttggggcca ggccgttgtt tgctgttttg ttgtttcttc 19380 
tgtgacgtta gccactgggc taatctgagc ccctcagtta caggtggaga aactgagacc 19440 
catgggggtg caaggacttg ccgaggaccc agagcccctt gggggcagag ctgaggcggg 19500 
gcctggcttt gggtcccaga gcttccagtc cccttcccgc tctcctaaca gctttttttt 19560 
ttgagacaag atctcaccct gtcacccagg ctggagtgca atggcatgat ctcggctcac 19620 
tgcaatcttc gctagctgcg ttccagcgat tctcctgcct cagcctcccg agcagctggg 19680 
attacaggtg tgtgccgcca tgcccagctc gttttttttt gtacttttag tagagatagg 19740 
gtttcaccat gttggccagg ctgatctcga actcctgacc tcaaatgatc cgcctgcctc 19800 
ggcctcccaa agtgctagga ttacaggctg ggatcacact gtgcctggcc ctagcagctt 19860 
tgtcctgtgc catccaacaa cagatgaccg aagtctttgt ttcttaacat gcattccatc 19920 
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tgccttacag ttttgccacc tgcaaaacag aggacttgtc gcttttctgg taagctggaa 19980 
atgtaatctg gtagcaggag gcctgtggaa gcttgccttt aatggccttg tgtctctttc 20040 
atcctgtcct gagagccgga gaacttggat gttgcaccta actcaacctt cctgttaaca 20100 
tacagttctg caggctcatg gatcatcaga accacgtcct atctcacgcg gctgtatgct 20160 
tccgttggtt caggtgtttt taccttgaca gtattttctc ctcggtggct tttgcggtgg 20220 
ttgcttttaa tcagcattga ctcttcaaga aaaatattta gctgctacat ctcagaggag 20280 
acagggtgga aagcatctga gacctgcagg ctcagactta gaaccagaag tgccctcaga 20340 
gttcatccgg ccctgaccca gcgggaaatg agttcacaga gaagcgggag aactttgccc 20400 
caggccctgc cgttgctcat aactgcccca ggtccttaca tttgctccag gtcctgcccc 20460 
aggccctgca gttgctcata actgccccag gtccttatat ttgctccagg tcctgcccca 20520 
ggtcctgcag ttgctctgtg tggtgggtgt gatctggagc cctccgccca ttgctgcacc 20580 
tggggcaggc attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 20640 
caggcagccg ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 20700 
tgttccaaat cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 20760 
cagttggccc tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 20820 
gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 20880 
cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 20940 
aaatacaaaa attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 21000 
tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 21060 
actgcactcc agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 21 120 
aaacaaaaaa caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 21 180 
tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 21240 
gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 21300 
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cctggcccat ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 21360 
agaggcggac cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 21420 
acttcatcta ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 21480 
ccagccggga cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 21540 
tggccaaggt cgtcggtgag tccggggggt cccaagccat ggctcagcca tgcagacttg 21600 
catgaggagg aagtgacggg tccatgcctg ggcataagtg ttgagctcag gtgccccgac 21660 
rtggggaagg gcaggacagg aaaggtgaca gtatctggcc aaggacagat gggaagggac 21720 
caagggagct gattagggag tggttatgga ctaggaatgt cggtaacaat ggttagaaag 21780 
tgactaacat ttgttgagca cctgctgtgt gcccggccct ggccgggagc cttcgtgccc 21840 
acagtgaccc cgtctgcaaa tgtagttcct tgccctactc gcactgggga gcaggacgca 21900 
gagccgtgca tctcacaggt gccaagctca ggactccctc ctgggtctgc ctgggctggg 21960 
ctgtgcttgt tgcccctgtg gcccacgcat gtgcaccttc cacctgaaag ccaggatctt 22020 
caggacgctc cccgaggagg tcgttgtctg gcacaatgat ttgtctcttc ctgaaaaggt 22080 
gacagagtta cactggagag agcagcatcc aggtgcggca gggacaggcc tggggctcgc 22140 
gggcagggac tctgtgtcct gccggggtcc cacactgcac ctgcttgtca gaggcactca 22200 
gtcaatcttt gctgatgaag gatgagagga cagaggacgt gatgcttgct gctgcattgc 22260 
ctgcagtcct gggtgagatg cccgggttga ctctgctgcc cgtcgggtgg atgtgatgtc 22320 
agatccccgg ctttaaaata cgagggagct gggaattgag ggagcaggtt ggggcagaaa 22380 
gcacagcccc gtggaagcct ggagctgagg cagtgtgggc gacccctgga gcagtgagtg 22440 
cttccttcat ggccttcatc gcaccctgca gtcctcatgt aggggatgcc atccatgaat 22500 
ttagttttcc cagcctcctt taaaaacgcg ttcatgctgg ggccggggca gtgcagtggc 22560 
tcacatctga aatcccacca ctttgggagg ccgaggcggg tggatcatga ggtcaggaga 22620 
tcgagaccat cctggctaac aaggtgaaac cccgtctcta ctaaaaatac aaaaaattag 22680 
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ccgggtgcgg tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 22740 
gcgtgaaccc gggaagcgga gcttgcagtg agccgagatt gcgccactgc agtccgcagt 22800 
ccggcctggg cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaagt acaaaaaaaa 22860 
aaaaattagt ctgggtgtgg tatcacgcgc ctataatctc actactcgag aggctgaggc 22920 
ggagaattgc ttgaacccag gaggtagagg ttgtagtgag cccgtatcgt accactgccc 22980 
tccacctggg caatagagcg agactctgtc tcaaaaagaa aaaaaaaaaa agaacattta 23040 
tgccaggtgt ggtggctcat gcctgaaatc ccagaacttt ggaagactga ggcaggagga 23 100 
tcacttgagc ccagaaattt gagagtgtct tccctgggca acatagagag acctcatctc 23 160 
taccagaaaa aaaaaaatta gcccggcatg gtggcatatc cctgtggtcc cagctactta 23220 
gggggctgac gtggcaggat cacctgagtc tggaggcaga ggttgaagtg agctgagatc 23280 
atgccactgc actccagcct gggtgacaga cagagaccct gtctcaaaaa aaaaaaaaaa 23340 
aaaaagcatt tactatccac catggaaggt gagactgacc tgtgagtgat tgttcaaaga 23400 
acaaaaaata aaccccagag ataagacaaa agggtgcctc catgggggtg tgatttaaag 23460 
ctgagaaatt gggcttcttc cccctcccct ctcaccccgt ggtttgctaa aggagatggg 23520 
aaaaaggatt ctttttttgg ctgaaatatt taacactaaa ttaaagccaa ttttaacagc 23580 
actttggttg atgagtgaaa ttaacagact ggccaaaaat aaacgaacgg tctgtactat 23640 
gtgaaaaaga ggcagctttg gccatgctgg gccaatgtga gttttcaggg ttgctgggaa 23700 
tgtctgtgaa tcggaggaag ggcctagctg ggactctcag gagccaaggc cctgaggggc 23760 
aacttgcctg gtccctgccc tgaggcgttc actgctttct tcctgggcca gatcacaggc 23820 
ccggaggctg gaccactggg ctggcactct tgccgagctg ctccctgact tcctgaccat 23880 
gctcctttca gcagccttgc tgcactttag tttccttgaa tgaaaaatgg ggatgagaat 23940 
agctcctacc tccaaggtga atggagtgag ttcggacagg tgactccctg ggaccagtgc 24000 
ctggcgcctg acaaggtcca gtcagagccc gcactgctgt tactgatacc cttggctgta 24060 
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ccaggggaga acttggttgc cattgccagg tgttctccca ccacccccac tactgtccct 24120 
gtttgatgtg tggcgggaat aaagctgtgc acattggagc ttttggcaca tcctggcttt 24180 
caggtgaaag gtgcgtgtgt gtttgagggt ttagcctggc caacccagcc atgaggtcgg 24240 
acctgacctg ggggtgagtc ctgagctcgg cacccctgag ctgtgtggct cacggcagca 24300 
ttcattgtgt ggcttggccg cacccctttc cctgctgggc tgttgatgtt tagactggag 24360 
cctctgtgtt cgcttccagg aaccaacccg tgtgcggaca ggaacggggg gtgcagccac 24420 
ctgtgcttct tcacacccca cgcaacccgg tgtggctgcc ccatcggcct ggagctgctg 24480 
agtgacatga agacctgcat cgtgcctgag gccttcttgg tcttcaccag cagagccgcc 24540 
atccacagga tctccctcga gaccaataac aacgacgtgg ccatcccgct cacgggcgtc 24600 
aaggaggcct cagccctgga ctttgatgtg tccaacaacc acatctactg gacagacgtc 24660 
agcctgaagg tagcgtgggc cagaacgtgc acacaggcag cctttatggg aaaaccttgc 24720 
ctctgttcct gcctcaaagg cttcagacac ttttcttaaa gcactatcgt atttattgta 24780 
acgcagttca agctaatcaa atatgagcaa gcctatttaa aaaaaaaaaa gatgattata 24840 
atgagcaagt ccggtagaca cacataaggg cttttgtgaa atgcttgtgt gaatgtgaaa 24900 
tatttgttgt ccgttgagct tgacttcaga caccccaccc actcccttgt cggtgcccgt 24960 
ttgctcagca gactctttct tcatttatag tgcaaatgta aacatccagg acaaatacag 25020 
gaagactttt tttttttttt tttgagacag agtcttactc tgttgcccag gctggagtac 25080 
cgtagcgtga gctcagctca ctgcaacctc cgcctcccag gttcaagcga ttcttctgcc 25140 
tcagcctcct gagtagctgg gactacagac atgcaccacc acacccagct aatttttttt 25200 
atatttttag tagagacagg gtttcatcat gttggccagg ctggtcttga actcctgacc 25260 
tcaggtgatc tgcccgcctc ggcctcccaa agtgctgaga taacaggtgt gagccaccgt 25320 
tcccggcata ggaaaacttt ttgccttcta aagaagagtt tagcaaacta gtctgtgggc 25380 
tggccttctg attctgtaaa gaaagtttga ttggtggctg ggtgcggtgg ctcacacctg 25440 
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taatcccatc actttgggag gccgacgtgg gcatatcacc tgatgtcggg acttcgagac 25500 
cagcctcacc aacgtggaga aaccccgtct ctactaaaaa tacaaaaaaa aaattaaccg 25560 
ggcatggcgg cgcctgcctg taatcgcagc tactcaggag gctgaagcag gagaattgct 25620 
tgaacctggg aggcggaggt tgtggtgagc tgagatggca ccattgcact ccagcctggg 25680 
caacaaaagt gaaactccgt ctcagaaaaa aaaaagtttg attggtgtaa ccaaagcgca 25740 
tttgtttatg gattgtctgt ggcagctttt gttctgccga gatgagttgt gacagatctg 25800 
tatgggctct aaagcctaaa acatgtgcca tccgcccctt tacagaaaaa gtgtgctgac 25860 
ctctgttcta aagtattgga caactacaat gtttgctcat ttattattct atgatttgtt 25920 
ttctgctttt tgttgttgtt gttgttgttg agatagggtt tccctctgtc actcaggctg 25980 
gagtgcagtg gtgtaatctc agctcactgc agcctcgacc tcctgggctc tagtgatcct 26040 
ctcatctcag cctccctagt agctgggact acaggcacac accaccactc ctggctgatt 26100 
ULLLLLLLL L LLLLLLLLL ttgtggagac agggtttccg catgttgccc aggctggttt 26160 
caaactccta ggctcaaaca cccacctcag cctcccaaag tgctgggatt acaggcgtga 26220 
gccaccatgc ccagcctatt ctactgtttg tattacatag ctttaaaaga ttttttatga 26280 
ctttaagtca caagggttct ttgtagaaaa aaatatatat ataggaaagt ataaaaagaa 26340 
agtaaaaatt gtccataacc tctccagcca gagacgaccg ttgctgacac ctcagcatat 26400 
tgcctttaag tcttttttct ctaagatagc atttctcttc atcacagtca tatgctacgc 26460 
agaattctgt atcctgattt tttcacttga cattacaaca ggtatttgat ggcgctgtga 26520 
caaactcttt ggcacaatct tttaaatgta tgaaatactc cactgcacag atgtttgctt 26580 
ttaggcttaa ctgttctttt attttgcgtg tgctggttac agccgggcac agtggctcat 26640 
gcctgtaatc acaacacttt gagagggtga ggcaggagga tcacttgagc ccagaagttt 26700 
gagaccggcc tgggcaacat agtgagaccc catctctaca aaaaactttt ttaataagtc 26760 
gggcgtagtg gtgcatagct gtagtcccag ccaccaagga ggctgagttg ggaggattgc 26820 
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ttgagcccca ggaggttgat gctgcagtga cctgagatta ctccactgta ctccaacctg 26880 
agcgacagag caagacttgt ctggggaaaa aaaaaaaaaa aatatatata tatatatata 26940 
tatatataca tatatacata cacgcacaca cacataatat aaaaatatat atttataaat 27000 
atataatata taatataaaa atatatattt ataaataaaa tttataaatt atatttataa 27060 
gtaaatatat aatatataat ataaaaatat atattatata atatataata aaatatataa 27120 
tataaaaata tatatttata aataatatat aatacatact tataagtata tatttaaaat 27180 
atatgtaatg tatatttttt aatgtatgat atataatata catttataaa tacacattta 27240 
tattatttta tataaaatat atataaaatc tccaagttgc tttttccaaa aaggtgtctt 27300 
gctgcatttc aaacattcat ttaaaaactt gaatgctggt gatctggtcc agaatgtgtt 27360 
cagtagctgc tgccagtggc caagcatctc gggagatgtc tacaaaacac gctggttctg 27420 
gcctggcgtg gtggctcacg cctgtaatct cagcactttg ggaggctgag gcaggtggat 27480 
caactgaggt ctggatttcg agaccagcct tgccagcttg gtgaaacccc atctctacta 27540 
agaatacaaa aaaattagcc aggcgtggtg gcatgtgcct gtaatcccac ctacttggga 27600 
ggctaaggct ggagaatcgc ttgaacccag ggggcagagg ttgcagtgag ccgagatcgc 27660 
accattgcac tccaggctgg gcaagaagag cgaaactccg tctcaaaaaa aaaaaaaaag 27720 
atgctggttc ctaaaatgtg gcccttttcc tcctcacctg ctgccagacc atcagccgcg 27780 
ccttcatgaa cgggagctcg gtggagcacg tggtggagtt tggccttgac taccccgagg 27840 
gcatggccgt tgactggatg ggcaagaacc tctactgggc cgacactggg accaacagaa 27900 
tcgaagtggc gcggctggac gggcagttcc ggcaagtcct cgtgtggagg gacttggaca 27960 
acccgaggtc gctggccetg gatcccacca aggggtaagt gtttgcctgt cccgtgcgtc 28020 
cttgtgttca cctcgtatga gacagtgcgg gggtgccaac tgggcaaggt ggcaggctgt 28080 
ccgtgtggcc ctcagtgatt agagctgtac tgatgtcatt agccttgatg gtggccagga 28140 
ctggtagggc cctcagaggt catggagttc cttcgtggag cgggtgctga ggctgtatca 28200 
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ggcacagtgc tggctgcttt cacctgggcc gtctcaccga agtgtccatg gagcctgcgt 28260 
agggtgggta tctgtgtcga ttttacagat gcagaaacag gctcagagaa accgagtgac 28320 
ttccctaagg tcacataccc agttagagca gagctgggcc aggaagtgct gtctcaggct 28380 
cctgaccagg tctccttgct ttgcactctt gccaaaacca tgatccagaa ctgactttga 28440 
ggtccccgga cctcaggctc ctccgaaatg gcctcttgga ggctgctgag ccacagctta 28500 
ggacccacct cgagaggcaa atgtgctttg agctgccagg cgtcctgggg gccctgcctt 28560 
gggcacgggg ttcagacagg ccccagatgt gtggggcgtc tttctggact tgagttttct 28620 
tttctgtgtg gtggacacag tgctcacccc ttaaagcacc tgtgatgtgt gcagcagccc 28680 
aatccctgcc tgtcgcctgt tctgctaggg aaggaaggaa gacttcagga tggcaggaca 28740 
acagaaagag gtccaggttt tagagcaagg gcaggtcaaa cttagaaaat tctggaatga 28800 
ggatgtgcat ttcctcttct ggatctgcta aaagaagagg gaaggagggg ctgctggggg 28860 
aggagcccag agccgagttt acatccggat cccgcaaggc ctcccctgcc ctgaggtctt 28920 
gttttgtgat gtgcttgtgt ccatcctggt ttctgccgtg tccccaacat ccggccaagc 28980 
ttaggtggat gttccagcac acactcaccc tgtctgtgca cctgtttttg tgtccgtaag 29040 
tgggtattta ctcaccttac gagtgagcca ctgtgggaat tcagggaggt ggcgcagtga 29100 
ccacccctgg agggatatgt gtgtggcagg ggtcgagggt ctcgcccttc cctgcttcct 29160 
gcgcgtggct ttctccagga cggggagggc tgagctgaag aggtggggac agttgcgtcc 29220 
ccccgccacc cactgtcctg cggtgagagc agactcactg agcctgccct tctcccttgt 29280 
gccttccagc tacatctact ggaccgagtg gggcggcaag ccgaggatcg tgcgggcctt 29340 
catggacggg accaactgca tgacgctggt ggacaaggtg ggccgggcca acgacctcac 29400 
cattgactac gctgaccagc gcctctactg gaccgacctg gacaccaaca tgatcgagtc 29460 
gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg cggggcagcc 29520 
gggcgttggc cacctcccag cctcgccgca cgtaccctgt ggcctgcaag ttccccaacc 29580 
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tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 29640 
tgtccccgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 29700 
gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 29760 
atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 
acttgcctta ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 29880 
ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 29940 
acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000 
tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 30060 
ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 30120 
gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 30180 
agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 30240 
tcccagcact ttgtgaggcc aaggcgggca gatgatttga gcccaggagt tcaagatcag 30300 
cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 30360 
tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 30420 
cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 30480 
agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 30540 
gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600 
ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 30660 
gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 30720 
cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780 
ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 30840 
accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 30900 
aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 30960 



WO 01/92891 PCT/US01/16946 

199 

tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 29640 
tgtccccgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 29700 
gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 29760 
atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 
acttgcctta ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 29880 
ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 29940 
acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000 
tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 30060 
ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 30120 
gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 30180 
agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 30240 
tcccagcact ttgtgaggcc aaggcgggca gatgatttga gcccaggagt tcaagatcag 30300 
cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 30360 
tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 30420 
cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 30480 
agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 30540 
gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600 
ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 30660 
gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 30720 
cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780 
ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 30840 
accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 30900 
aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 30960 
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catccaggcc gcatgcaaac ctgttgccag gcgagaaacc agtcaccgca cagctgtggt 31020 
tgcctgaaat gattaagctc attaatcacc ccgg agtg ag gacagactca gatgaaaacc 31080 
agcaaaagcc ctggaaactc atgtgaccct gccaatgagg gcggccatgt gcattgcagc 31140 
ctggccgtca ctcctcggta cgtgttttgg acttaaacgc tccggatgtt tactgagtgc 31200 
ttgattaata acatggaagg cctggtctca ttgctgtggg agtgaaggat gcacagccag 3 1260 
gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 31320 
gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 31380 
cttcagtggatcttgctggtwccaaggtggccaaacactttaaggcattcatgtggtc 31440 
gctaggctgcagggttgaaccctggctcaccccgcagggcgccgtgtgctctgtggc^ 31500 
gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 31560 
ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 3 1620 
cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 31680 
tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 31740 
atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 31800 
ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 31860 
gcagccgtaa gtgcctcatg gtcccccgca cctcactccc tcgttagatc aggctggttc 3 1920 
tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 3 1980 
ggtaggttgt gcactgcaca agctgcatga tgctacctgg gggtccaggt ccaggctgga 32040 
tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 32100 
tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 32160 
agcacagtgctgagtgcmccatagcgctcgtttactcctcaagcctg^ 32220 
gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 32280 
ccaaggccat acaactgggg tgggttg a g C cggggttgac tgtggcaggc tggctcaaga 32340 
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gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 32400 
acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 32460 
tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 32520 
agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 32580 
aggccccgtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 32640 
gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 32700 
ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 32760 
tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 32820 
acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 32880 
ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc gctcgaagtg 32940 
ccacagcttc tgaagccagg caggactcag gccaggagac tctgttagct gttgagaggg 33000 
agaggccaac ggatgttctg gttctgctag agagctggtt cttcggatcc tggtaccagt 33060 
gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg tgctgctcac 33 120 
tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt ttccgcagtt 33 180 
cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta tgagtttcac 33240 
tacattccac caccagtgtt cactaccatt acctgccttc cgtcttaagt gttcatcatt 33300 
taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca gcactttggg 33360 
aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcctgg ccaatatggt 33420 
gaaactccat ctctactaaa aatacaaaat tagctgggca tggtggggca tgcctataat 33480 
cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc agagcttaca 33540 
gtgagcccag atagcaccac tgcagtccag cgtgggcaac agtgcgagac tccatctcaa 33600 
aaaaaaaata aataaataaa agaaaaataa atttatgatc tatttcaaaa ataacacatg 33660 
tactttgaaa cagcagagac acatatgaca cggagaatga aattccccat agcgcacccc 33720 
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caagagacag ccctggtccc cccgtctttc ccgtggacct ccagcggggc agatgctgag 33780 
ccgcctgttg tcgagtggcg tgctatcccg tcctccagct cctctgtggc ttacagacac 33840 
ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct gttcagccag 33900 
aaatctgcca tcagtcggat gatcccggac gaccagcaca gcccggatct catcctgccc 33960 
ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa gttcatctac 34020 
tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca ggcaggtgcc 34080 
ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt tccaggctgc 34140 
tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat gatgacttgg 34200 
gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaaccccg agactgtgat 34260 - 
tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc agaccccaca 34320 
ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta gttggtactg 34380 
gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg ctacaatgct 34440 
gtatatgctc ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga cacaactcat 34500 
agggaaagcg gggccagggc acagccctca aaatctccat caatgacatg taagaaaaga 34560 
gaggaacctg ggaaatagca aagtgccttt tgcacattaa atggttagct atatcccaca 34620 
atactgtgca ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 34680 
ctggagttgg cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 34740 
ggaggattgt gggaaatcaa atggaaagtt ctcaccccag gcgtggagaa gagtgggtca 34800 
tggccccagc agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 34860 
gcgcagttcc ctgccggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 34920 
tgtgcatgag caaatgtgaa accgtgctgt gctcaaattg tcctaataca tcattgcatt 34980 
ggaacagatt ggcttttttt tttttttttt tttttttttt tttttgagat ggagtctcac 35040 
tctgtcacca gcctggagtg cagtggcatg atcttggctc actgcaacct ttgcctccta 35100 
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tgttcaagtg attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 35160 
cgcggccggc cagatttgca tttttgaaac aactgctagg ctgggcgcgg tggctcacac 35220 
ctgtaatccc agcactgtgg gaggccgagg caggtggatc acctgaggtc aggggttcga 35280 
gaccagcctg gccaacatgg tgaaaccccg tctctactga atatacaaaa atcagctggg 35340 
tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 35400 
aacccaggag gcagaggttg cggtgagccg agatcacacc attgcactcc agcctgggca 35460 
acaagagcaa aactccatct caaaaaataa aaaatagaaa aacaagtgct gtagcggaag 35520 
tgagcacttt gcggagtcag gcttgtgtgg cctgttccac aaatgatgtg ctcacggtgg 35580 
cctcaggccc acctggagtc tgcagcatgg ggcacaacag gttcattagt gtagaattcc 35640 
aggacaggcc tggctcctaa gcagccttct tttacaaaaa ctgcagagcc cgcctgtatc 35700 
ctagcacttt gggaggccga agtgggtgga tcacgaggtc aggagttcaa gaccagcctg 35760 
gccaacatgg tgaaacccca tctctactaa atatacgaaa attagctggg tgtggtggca 35820 
cgcgcctgta gtcccagcta ctcgggaggc tgaggcagaa ttgcttgaac ctgggaggtg 35880 
gaggttgcag ggatctgaga ccatgtcatt gcactccagc ctgggcaaca gagcgagacg 35940 
ccatctcaaa aaaaaaaaac ctacagagcc acacggcctc tttctccacc gagtgttggt 36000 
gtgggagctt gtgttattgt ggtgaaatct tggtactttc ttgaggcaga gagaggctga 36060 
gcgcctggag agactttcac atgggtcgcc atgtccgccg tcggtttcgc tgttgtgctc 36120 
cccatctgaa ggctggtgcc gtccagacag gctggacgcc cctttccacc agatccttcc 36180 
tcccgcagca gtttctagtt acgttgtact gtgaggtctg tgtccttggt tgatggcaaa 36240 
agtcagccga attgaaattc agagccatgc ctggctccct ggagcttctc tcctgggcag 36300 
ctgtgatcat tgcctctgct gtggtgtggg tggtggaaat ggattccttt catcttgctt 36360 
gctacaggtg actgtcacgt ggagtccttt ggagagaggg acgtgttaat tgatggatgt 36420 
ggctcccatg ctgagaaagc tcctgggcgt acattgcctt agagtttcat tggagctgcg 36480 
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ttcttttatg gtgtctgcta ggcagaagtg atgaagactt ggaagaaaac ccagaaggtt 36540 
ttccacttaa tttggaaaat gtgcttttcc cctcctgtgt cttttgctaa ggtccagcct 36600 
cctgcagcct ccccgctctg tggactctgg ctttgattct ttattaggag tccccctgct 36660 
cccccaaaag atggtgtcta aattatcatc caattggccg aggttttgtt ttctattaat 36720 
tgtttttatt ttttattgtg gtaaatttat ataacataaa atttgccatt ttaattgttt 36780 
tgttattgtt gtttttgaga cagggtctca ccccagtgcc caggctggag tgcagtggtg 36840 
cgatcatggc tcactgcagc ctcagcctcc agggctccag tgatcctctc acctcagcct 36900 
ctctagtagc cgggactaca ggcatacact accacatctg gctgattttt tgtatttttt 36960 
ttttattgta gagacccgct atgttgccca ggctggtctc aactcctgga ctcaagccat 37020 
cctcccacct caccctccca aagtgctggg attacaggca tgagccacaa cacccagcca 37080 
ttttaatttt tttttttttt tttgagatgg agtctcactc tatcgcccag gctggagtgc 37140 
agtggcgtgg tatcaactca ctgcaacctc tgcctcccag gttcaagcga ctctcctgcc 37200 
tcagcctcct cccgagtagc tgggattaca ggtgcccatc actatgcctg gctaattttt 37260 
gtatttttta gcagagacgg ggtttcacca tgttggccag gctggtcttg aactcctaac 37320 
ctggtgatcc gcccgcctcg gcctcccaaa atgctgagat tacaggtgtg agccaccgtg 37380 
cccggccttt ttttgttttt gagacagggt cttgccctgt cacccagact ggagtgcaat 37440 
ggtgggctct tggctcactg cagcctccgc ctcccaggct caagttgtgc acctccacac 37500 
ctggctaact gtattttatg tagagacaga tttcaccatg ttgcccaggc tgggcttgaa 37560 
atggactcaa gcagtccacc cacctcagcc tcccaaagtg ctgagattac aggcgcgagc 37620 
caccgcaccc agcccatttt acctattctg cagttgacag ttcagtggca ttcagtcagt 37680 
tcacgaggta accatcactg ccattcatct ccagactact tcaccttctc ggcagatgtc 37740 
cgaaactgtc cgcattgaac acactcctca tctccctctg acagccacca ttctactttg 37800 
tatctctctc tgccttctct aggtacctca tgtaagtgga attataccaa tatttgccct 37860 
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tgtgtgactg gcttctttca tgtgacatgg tgtcctcaag gttcatctgt gttatagcct 37920 
gtgtcagaat ttccttcctt aaagcctgaa taataacccg ttgtaaaggc tgggcgcggt 37980 
ggctcacacc ctctaatccc agcattttgg gagtccgagg tgggcagatc acttgaggtc 38040 
aggagtttga gaccagcctg gccaacatag tgaaaccctg gctctactaa aagtacaaaa 38100 
ttagctgggt gtggtggcgc gcacctgtaa tcccagttac tcaggaggct gaggcaggag 38160 
aatcgcttgt acccgggagg cagaggttgc agtgaaccaa gattgtgcct ctgcagtcca 38220 
gcctgggtaa cagagtgaga cttcctgtct caaaaaaaaa aaaaatcatc ggatggatgg 38280 
acggaccact tcttgttatt tatccatcca cgggtgctag gtttcttcca cctttggttg 38340 
tcgtgaataa ggccactatg aacatttcct tccgtggtga aggttttgta ctagtgagga 38400 
aaaggcgtgt ttgtggtgtt gcataggatt ctggtaagaa agtttgcact aaccataagt 38460 
atttgtacta cattaaaatg aaagctcagg ggccgggcgc ggtggctcac gcctgtaatc 38520 
ccagcacttt gggaggccag ggcgggcgga tcatgaggtc aggagatcaa gaccatcctg 38580 
gccaacatgg tgaaaccccg tctctactaa aaataccaaa aaactagcca ggtgtggtgg 38640 
cgggcacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaacccggg 38700 
aggcggagct tgcggtgagc cgagatcgct tcactgcact cgagcctggg caacagagca 38760 
agactccgtc tcacgcaaaa ctctgtctca cgcaagactc cgtctcaaaa aaaaaaagag 38820 
ttcagggttt atgaaactgg ccagccgcgt aaagtttgct gtgttgtttt tgtgcccggg 38880 
aggagtgtgg ccagggtgtc acgtcacaca gtacacgttt ctcagatggt ggttctccag 38940 
actgctgtcc caaagtctgt ttttgcatct ggttcccaca gacccaccct ccacggtgag 39000 
cctgattttg gccagggtag ctggaatctt gcttgtcttt cagcccggca gctgtaccag 39060 
tccagggtcc acagctagtg gcttttagga aggaatttgt tcagttggct ttgacacatg 39120 
gccccctagg gtccacagct ctgtagtgat gtggatgttg ttatctacaa agacacatga 39180 
tccttcgtgt ccagatgaaa gtgatgatgt ctttgcagct gcccagcaag gctgtgtgtg 39240 
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tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tggtgtgtgt gtggtgtgtg tgtgtgtatg 3.9300 
ggggagggag gcaccctttc catctggggg tgtgtgtgtg tggggtgtgt gtgtgtgtgt 39360 
gcgcgtgtgt gtggtgtgtg gtgtgtgtgt gtgtatgggg gaggcaccct ttccatctgg 39420 
gtccaagaga ctgggcctgg ggaagacgct tctttttatc tacttagaga ctttgtttta 39480 
tttgtatttt tttgagacag ggtctcactc tgtcacccag gctggggtat ggtgatatga 39540 
gcatagctca ctgcagcctc ggcctcccag gctgaagcga tcctcccacc tcagccttct 39600 
gaatagctgg gactgtaggc gtgcgtcacc atactgagct attgtttttt ttgtttggtt 39660 
ggtttaattt tttttgatac agatggagtc ttgctatgtt gcccagacta gtctcaaact 39720 
cctgaactca agtgattctc ccacctcagt -ttcccgacat tctgggatca caggtgtgag 39780 
ccactgctgt ctccctgttt tattaactgc tgaaagacct agataaagaa agtctgaaaa 39840 
gacttactat cagagcacca tcctaagatg attccctctg actcaatgga gagggagggg 39900 
agcttttcct tcaggcctgg gtggcaggag cccaggtgct ccaggcccca tttgccccag 39960 
gccaaatcac tcgggaactt ggatgcagct gtctttcagg gtaacccaaa ggaaccagat 40020 
ccccgcaggc agtaggcttc tgggctgtcc tctcctccta cgtcagctca gtaagagccc 40080 
ttcgaaggga tgctgtgtcg gaggccccaa aagcccaggc tcatccctga gatgcacagg 40140 
gtgggctggg cttaggcagc gctcgagcat ctcctggacg gtgaccccag agagtgtgga 40200 
gacggagagt ccttgagagt cactgagaga cgtggctgcc ctgccttccc aagaggggct . 40260 
ctgagtcatt ccccacactc acctgcccct acccaccctc acctggcccc cagcctcacc 40320 
tacccccaca tctgtaccga tccctttacc cgcaccttcc ctacccaccc tcacctcccc 40380 
tgtaccttca cctcccccac tcacccgccc ctgcaccctc acctgtcccc caccttcacc 40440 
taacccccac cctcacctgc cctcccctca cctggcctcc ttccgttggg gaaggggttg 40500 
taaggggcgg cccccaaact gtctgtcctg gtgccctgca gagaaaacag tacgtgaggg 40560 
ccgcagtcca aaagcttgag tcctggaagg tggaggagac agggatgtgt tgggaagggc 40620 
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cccatggtct tggatccctt ctcgactgtc aatggggcct tcatgggagc gccagtctag 40680 
tgatgcacag ctgggtgccc ggcgggtggc tgaggaggcc taaagtccga ggcggcaaga 40740 
gctcttccag aggctgttgt cctaatcgct ctggcatact caggcgggca cgtagttagg 40800 
agctgattgg agaggagaga cccccacacc aatactggga tttgactttc aggctaaact 40860 
tgagaagtgt ggcctctgct gtcctgccag agctctccag ccagtgccca gggctctcca 40920 
gccagtgccc gggggtctcc accagtgccc gggggtctcc gccagtgcca ggggtctccg 40980 
ccagtgccca ggggtctccg ccagtgctca ggagtcttgg tttctttgtc ttacagccct 4 1040 
ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac gacctcagca 41100 
tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc atcaacgtcc 41 160 
acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac aagcccaggg 41220 
ccatcgtcgt caacgcggag cgagggtagg aggccaacgg gtgggtgggg gtgctgcccg 41280 
tccaggcgtg cccgccgtgt cttatgccga atgccagcct ctcacaggct ggggagactt 41340 
tccacctggg gatccaatgg gtggctttcc agggtcccaa aagcaaacac aggtttttca 41400 
cagcccgtcc gggaaagcag aaagccccaa ggggctggaa ggggaaaggg ggagctctgc 41460 
tgagaggtta caaggcagcg ctggccgacg ggagttgcag ttgataggtt ttgtatcatc 41520 
cttgttaaac ttgaaccctg tgcagaaatc ccttccacgg catgggggct gcctgttgac 41580 
tcgctcctgt tccaccacag ggagctcctg ggcttcttcc tcccagaggc ccccgacgct 41640 
cccacctgtt ggtcgtcaga gcttctggtt ggtgggaagg cacccaggac cttgaggtct 41700 
ccagagagaa aagccaggga aagagggaga ccgaaaccca tgtgacatga aactcaggct 41760 
ccaaactgag cacgggaacg tttggggaca ggagcgcgat ggccttcctc agatagctgg 41 820 
ggggctggca tgaagacggg agctacagcc agcacaggtc ctgggccggg agcccagaga 41880 
ttgagccctg actctgtcac ttactggcca cgtgaccttg ggcgggtggc atagcctctt 41940 
ggagactcag tttcctcatt ggtaggagtg acggccacag tggtgcggcc tctgcagcac 42000 
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acggggggct cggtgggcgg aagccccggg tctataaggc ggctgtgcag gagccagccg 42060 
agctggtctc ccaacagcca gggctccggg gtccttagca gctgtggggg gcctgcacct 42120 
gtttcccatg gctgctgtca gaaattacca gaagccaggt ggctgagagt aatggacact 42180 
tgttctctca cagttcctga gggctgaagc ccgagatcga ggtgtgggca gggccctgcg 42240 
ccctctgaag gctctgaggg aacctttggg cttctggtgg ctccaggcac cccttgactt 42300 
gtggtcctgt cactccagtc tctctgtctg gctgcacatg gcgtggcctc ttctgtacca 42360 
ttgaaggaca cttcagttgg atttagggcc taccctcacc cattgtggtc gtatcttgat 42420 
ccttcatgac atttgtaaag accctgcttc caaataagct cacattctga ggttctgggg 42480 
tgagcgggaa tttggagagc attgttcaac tagtatagaa tgtgacctgt cagcctcggg 42540 
cagccctgag aggcaggggc tttccacagc ccagctgggt gccctgggct ccgtgctgtc 42600 
cgaggagacg ccatccccac acccgtcctt cacccgccac cctcccgcag gtacctgtac 42660 
ttcaccaaca tgcaggaccg ggcagccaag atcgaacgcg cagccctgga cggcaccgag 42720 
cgcgaggtcc tcttcaccac cggcctcatc cgccctgtgg ccctggtggt agacaacaca 42780 
ctgggcaagc tgttctgggt ggacgcggac ctgaagcgca ttgagagctg tgacctgtca 42840 
ggtacgcgcc ccggggcctg ccctaaccgc agacacccgg ccttcattgt cagtaatggc 42900 
agcagctgcc acattgtccg agacctgccg tgagcccagt gccgcgccag gggctttgtg 42960 
tgtagcgtgt tttgtcctca cactgacagc tgtaggctgg ggttctgagt gagccccaca 43020 
gggcagaggc agaaaatgag tctcagagag ggtgagcgag ctgcttgggg ccccacagca 43080 
ggagatggag caggactgca gcctagcctc tgcccccagc acctgcgcaa gaagctgctc 43140 
tgctctggac tgtgttaggc tgcgagggct ggagagaaat gagagttggt gcttagagag 43200 
ggggcgcagg tccccatggc ttttcctctt atgatgaggt agatgggtga agggaggggc 43260 
catgcttgca ggggccagtg accgaggccc gccgttggaa ctgatggcct tcatcccgag 43320 
cccagcccag gtgggagcag ggctttccga gggcttgtct tgggtcggcc tgcttccagg 43380 
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gactctgctg cagctcccac ccctgtccaa agcatggaat cccccaggct ccctggcagt 43440 
cctgtcaacc tctgtcctcc caagctgagt gtggggcaag ttctggaggt cagcactgct 43500 
caggggggcc cacgggctgc ttgcaggggc caaccgcctg accctggagg acgccaacat 43560 
cgtgcagcct ctgggcctga ccatccttgg caagcatctc tactggatcg accgccagca 43620 
gcagatgatc gagcgtgtgg agaagaccac cggggacaag cggactcgca tccagggccg 43680 
tgtcgcccac ctcactggca tccatgcagt ggaggaagtc agcctggagg agttctgtac 43740 
gtgggggctg gcagtggggt gggcagggtg gcctctaaac ccgacccctg gaggaggctg 43800 
gaggccagtg caagatcctg tgtggcctca gccaggcggt ggtctctgcc agatgccaac 43860 
tgttgcccgc tggggttcag cgacatgtcc gaatgtcccg aggcctctga ggttgttttc 43920 
ttttgccgca gaacaaatca ccacgaacag cgttttaaga caacaccaac tctttttttt 43980 
tttttttttt tgagtcagga tcttgctctg ttgcccaggc tggggtgccc tggtgcaaac 44040 
acagttcact gcagcctcga cctctgggct taattaagtg aacaccttgc ctcagcctcc 44100 
caggtagctg ggactacagg tgggcaccac cacacctggc taattttttt ttgtagagac 44160 
ggggtttccc catgttgccc aggctggtct gcaactcctg ggcacaagct atctgcctgc 44220 
tgtggcctcc caaagtgcta ggattatagg tgtgagccac tggcctgaca acacccacgg 44280 
attgtctctc agttctgtaa ggcaaagtcc aggcacagcg tggctcacct gggttctctg 44340 
ctcagggtct cacggggcca gaatcaaggt gtcaggaacg ctgggccctc agcggaggct 44400 
ctgtggagaa attagcttcc ttgctcactc agcaggtagc agttgtggga tcgaggttct 44460 
gttttctctc tggttattgg tcggggacca ctctcagctc ctagaggcca ccacaggtcc 44520 
ttgccccgtg gccctctctg cctcagcagt gggggctccc tgcgtcagtc cctcccacac 44580 
cttgagtctc tctgatttgc ttctaaaggg ccctgtgatt cggctcagcc acctttagat 44640 
taggttagcc tcccctttga tagactccaa gtcggctgat taataacctt aatcacatct 44700 
gcagaatccc ttctgccaca taaggtcatg acgccgtgct ggggactggg gtgggaaatt 44760 
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acggggtcat ttaggattct gcctgccact gccttgctgt gtcccagggc ttgggggagg 44820 
ggcctccaca gctgggacca cagtccttcc tcccctccat ggtaaccatc tgaggattac 44880 
ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaatac aaacaaaaag 44940 
ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 45000 
ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 45060 
tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 45120 
ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 45180 
attacgccac tgcactccag cctaggcaat agagtgagac tccgtctcaa aaaaaaaaaa 45240 
gggccagggg tggtagtgac aaagagaccc tatcccaaaa aaaccgaaca ctgaatcctt 45300 
gagactgagt aaggacactg tgaaattttt ctgggtgggg cagggaacag agcgtcttct 45360 
gtcatttctt ccacctgggt gtggtcagct ctccctccaa gctgcctcct cttcttctca 45420 
ttgtccgggt gttggacaca tttggttaac tggatagaat aacgcgagtt cccagggact 45480 
tggtccattt gctattttat tttattttat tttattttat tttatttatt tatttattta 45540 
tttatttatt tattgagatg gagtttcgtt tttgtcgccc aggctggagt gcagtggcgc 45600 
gatctcggtt cactgcaacc tctgcctccc aggttcaagt gattctccta cctcagcctt 45660 
ccaagtaact gggattacag gcacccacca ccataccagg ctaatttttt tgtattttta 45720 
gtagagacgg gttttcgcca ttttgcccag gctggtcttc aactcctagc ctcaggtgat 45780 
ccacgcacct cggcctccca aagtgctggg attacaggca tgagccacca cgcctggcac 45840 
catttgctat tttaattccc atgtgtatta gtgtcccacg gctgctgtaa caaatgacca 45900 
caaactggat ggcttaaagc aacagaaatg gattccccca atgtgctgga gaccagaagc 45960 
ctgcgaccaa actgttggga gggctgtgct tcctctgggg gctccaggga ggatctattt 46020 
gttggccctt ccagtgctgt gggtgccagc gttccacact tgtggatgcg ccgcctcaac 46080 
ctctgcccat cttcatgtgt ccatctcctt tgtgtctgcg tctttacctc ttcttcttgt 46140 
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ctgtgttgcc tcttataagg acgtttgtca ttgggtttag ggcccaccca aatcatccga 46200 
gatgacctcg tcttgagatc cttaacctgc aaagaccctt tttccaaaaa aaggttatgc 46260 
tcacagattc taggccttaa gacatgggtg tatctttctg gggggcacta tccaacccct 46320 
tatacaatga aagacgggaa gagggccagg tgtggtagtt cacgcctgta atctcagcac 46380 
tttaggaagc tgaagcggga ggatcacttg agcccaggag tttacaagta gctaggcaac 46440 
atgatgagac cccatttcta caaaaagtga aaaaaaaaaa aaaaaaaaaa aagccaggtg 46500 
tggtggctca cacctgtaat cccagcactt tgggaggctg aggcaggcag atcacgaggt 46560 
caggagattg agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa 46620 
aaattatggc cgggcgcagt ggctcccgcc tgtaatccca gcactttggg aggccgaggt 46680 
gggtgaatta caaggtcaag agatcgagac catcttggct aacacggtga aaccccatca 46740 
agatcacaag gtcaagagat ggagaccatc ctggctaaca cggtgaaacc ccgtctctac 46800 
taaaaataca aaaaattagc cgggcatggt agcgggcgcc tgtagtccca gctgctcggg 46860 
aggctgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcggtga gccgagatcg 46920 
ctccatgcca ctgcactcca gcctgggtga cagagtgaga ctccgtctca aaaaaaaaaa 46980 
aaaaaaaaaa aaaaaaagaa aattagccag gcacagtggc aggtgcctat tgtcccagct 47040 
acttgggagg ctaaggcagg agaatggcat gaacccggga ggtggagttt gcagtgagcc 47 100 
gagatcatgc cactgcgctc cagcctgggc gatagagcaa gactctgtct caaaaaaaaa 47160 
agccaggcat ggtggtgcat gcctgtagtc ccagctactc aagaggctga ggcaggaggg 47220 
ttgttcgacc cacggagatc aaggctacag tgagccatga tcgcaccact gccctccagc 47280 
ctgggtgaca gagtgtgacc ctgtctcaaa gtaagtaaat aggaggagag acaagtgggc 47340 
agttcagact gatggtatgg gcacagtaga gactggtgca gacaggctgg cctgtgatgt 47400 
caagcaactt ctgtaattgt ttccggcatc catttgtgtg tcaatttccg tgtcagtagg 47460 
aagactctgt aggctgccaa gaggaataag tgggaggatc ctcccagaga ggccgggcct 47520 
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gcaggagggc cagttctcat gagttctcat ttggccccta ccctccaggc tgtggttctg 47580 
aggtgggaga cagagcctga cctctgtttg tcttgttttg tctttgcagc agcccaccca 47640 
tgtgcccgtg acaatggtgg ctgctcccac atctgtattg ccaagggtga tgggacacca 47700 
cggtgctcat gcccagtcca cctcgtgctc ctgcagaacc tgctgacctg tggaggtagg 47760 
tgtgacctag gtgctccttt ggggtgatgg acaggtacct gattctctgc ctgctaggct 47820 
gctgcctggc atccttttaa aatcacagtc cctgtggcat ccagtttcca aagctgattg 47880 
tgtcttcctt tgccctcctt tcttttctac tatgtgcatt cggtgctatg aattttcctc 47940 
taagtactgc gtttcctgca tctcacaaat tttgttacat tttcattttc aggtagtttg 48000 
aatattttta cacttctcct gagatgacat ctttggctca tgtgttattt agaagtgttg 48060 
cttagtttct aaagagttgg ggcttttcca gctgtctctc tgcaactgat ttctaattta 48120 
attctactgt agtctgagag cttattttat atgatttctg ttattttaaa tgtgttgggt 48180 
gtggtgtttt tgttgttatt gtttttgtgt ctttttgttt tgttttgctt cgtttgtttt 48240 
gtttttgaga cagtgtcttg ctctgtcact caggctggag tgcaatggcg cgatctcagc 48300 
tcaccgcaac ctctgcctcc cgggttcaag tgatcctctt gcctcagcct cctgagtagc 48360 
tgggattaca ggtgcacgcc accataccca gctaattttt gtatttttag tagagacggg 48420 
gtttcaccat gttggtcagg ctggtctcga actcctgacc tcgtgatccg cccacctcgg 48480 
cctcccaaag tgctgggatt ataggcgtga gccactgtgc ctggccatta ggtgtgtttt 48540 
atcacccagc atcatgcagt ttatcttggt gaatgttctg tgtactcttg aaaagaatgt 48600 
ggattctgct gttgttgggt ggagtgttcc agaaacatca attagatcca gttggttaat 48660 
agtgctcatc aggttgtctc tatccttcct tcctgactgc ctgcttgagc tgtcagttat 48720 
tgacaggggt gtggagtctc caactctaat ggtggatttg tttatttctc ctagtagttc 48780 
tatctttttc tctccttcta cccttgatcc tcttctcccc ctagggcttc ctggtgttag 48840 
tggtgggaga gtggggtagt gaagaacctg gactttaggg ccaaagaggc cagggttcaa 48900 
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atcctggctc tgtcacttcc cagttgagtg accctggctg gtgcctgaat ctctgtgagc 48960 
ctccacttcc tcctctgtga aattgagagc acttacctgg caggctgtca tgggcatcaa 49020 
gtaacagggc actccacctg gaccctgaca cgtgatgcac aggaatgcca gctgctatgc 49080 
catgggtgtg gcagtagtaa taaagtgacc atctgtatcc tcaccacagt gaagcctgtc 49 140 
cagggctttc tctcctatgc ccccatgcct ccaggtggcc ttggatcctg ttggttctgt 49200 
gctctgctca gcgacctttc tcccgtggga gttcctgggg gttcagcttc atcctacaga 49260 
cagcagcaca cactggctgt gcaccctttt tttttttttt tttttttttt tgagatggag 49320 
tctcgctttt ttcgcgcagg ctgaagtgca gtggtgtgat cttggctcac tgcaacctct 49380 
acctcctggg ttcaagtgat tttcctgcct caccctccca agtagctggg attacaggct 49440 
cccaccacca cgcccggcta atttttgtat tttcagtaga gatggtgttt caccatgttg 49500 
gccaggatgg tcttgaactc ctgacctcag gtgatccgcc cacctcagcc tcccaaagtg 49560 
cagggattac aggcgtgagc caccacaccc ggagtgccgg ttgtttttag cagtttgtct 49620 
tgttcctgga gagactggct cctgcccagg agctcgggga gtagggccgc ggggtgctgc 49680 
ctcacacctc gagtttggcc gtaagcagag gggacatttt gtgactgtcc ccctcctgag 49740. 
cttcccagca gcttttctcc aagttacagc ccaaaagctc aggtggattt gcaacccaac 49800 
ggtgtctgtg cacctcccac tgatgcccga actgccctgg ccaagaaacg gggccgtcag 49860 
aacgctgcac taactgcagc cttgggcctc catgccagag gccatgccct tccatccacc 49920 
accccctggc ctgggccctg ggccctcctg gctcgggaac tccaggcccc ttcctcacgg 49980 
ctcgagagac gtgtatttac cgcacaggtg cttgtcattc tcttgtggcc tcttctccag 50040 
ggagatcaca gaaggacagg gcctcactga ggtctcggac atggaccctt tgatagtggc 50100 
aggagccagg ctgggcaaga ggcggccaca gtcacctcag cagtgccatc accaccgcca 50160 
ttcagccctt ccctgagccg ggcgcgcccc tggctctggc cccagtgtcc cagttacagc 50220 
tcacaggagc ttgtggtgcc cagcggctgc ttctgattga gagtcgaggt cggaggcttt 50280 
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gggaggctga gaggctgctc ggtttcacaa ctgctgaggg agacttgggc tccatctcag 50340 
gtatgcccca tgtcgccctc aacctccagc caccggtcct ccgtgtcccc catggccagg 50400 
cacggcttgc agacatctgt cgttggctcc tctcagccgt cgtgggctga ccctggcacg 50460 
tcctcctgtg gctgagccca gtggggacag ctgcttcctt ttattaccct agaactctcg 50520 
tctttgatca ggccccctcc cctatgccac acagtccctg tcactcgggt gagcccagta 50580 
gtcatgggga aggcctgcgg gttccaaaca tccaaaggct tgcgtgcagc atgacagctt 50640 
gaaaccgatg ttttttacct tgatcagatt tcagcttggc gggggctttg ctcagctttc 50700 
agtgaggcct gggccgattt cccagcatcc cctcctgagg ccagcctctg tttcctgtga 50760 
ttttctgcac aaagtgggag ggaggagtcc taggaaatgg ggggccacct cgaagcctag 50820 
gcctcctctg gcttctctgt gccagtgccc ccacgctttg tgtctgtgtc cccagcccat 50880 
gggactctgc tattccctga gtgctgccgc atgcccagcc cgcactgagg acgtggagcc 50940 
ccgaggggca ggatggcctc catggtcaca cgtaggaagt ggcctccacc ctccgatgat 51000 
cctctccctc ctccctttca gcgccctccc cgggggtgtc ctcagccctc ctgcctgtgc 51060 
tttgtcccgt cttctgcagg cgcctgggac gtgctgacag gtcctctgcc ggctcctgcc 51120 
ttgctatgcg cacgctggtc accacagagg cctggccctt cttctgtagc agtcccacac 5 1 1 80 
ccgcaacagg tgtggctgct gaccacctgc tttctgcccc tctggtcctg aggagggcgc 5 1240 
agtgggcact caggcgtggc tgagcagatg tgtgttgccg ggaggaggaa ggactgctcc 5 1300 
agtcagggct gaatttccca cccggagcat ttctgctgta tttggtgtag cgcctgctgc 51360 
ttaaagctct gattcccagt tggcaccctt tcccttctgc attgaaaaac atacggatgc 5 1420 
atgtcttctt gcagtgaatg tgtattctcc cagcctctct tctgggttgg ggctggaggt 51480 
ggagcggcac acaggagccg cagcgatgga ggatgtgcgg gtgcagcacc ccgtacagca 5 1540 
gggatgccaa acccgcgctg agtccctctc aacttctgct ttgaagccca gtcacgccat 51600 
tgcctgggtt ttgctgggcg gggctgcgtg tgatgttctc ctctgtccct cccccagagc 5 1660 



WO 01/92891 PCT/US01/16946 

215 

cgcccacctg ctccccggac cagtttgcat gtgccacagg ggagatcgac tgtatccccg 5 1720 
gggcctggcg ctgtgacggc tttcccgagt gcgatgacca gagcgacgag gagggctgcc 51780 
ccgtgtgctc cgccgcccag ttcccctgcg cgcggggtca gtgtgtggac ctgcgcctgc 5 1 840 
gctgcgacgg cgaggcagac tgtcaggacc gctcagacga ggcggactgt gacggtgagg 51900 
ccctccccgt caaggctctg. ccaagaccct ggccctgccc tccgggatac gagcttgggg 5 1960 
ctgcctccgg cctcacagga gtaggggctc tgaaaacctt tgcttgcagg gagattgcca 52020 
agtctgtctt ttaggcccaa caaggaaaac tctgcagttc cacccatcct gtcccaccag 52080 
gtagtgtggc ttgaaggcag actgtgaggg tctatctcac cttcctgcat taggtcagga 52140 
gtttcacaga aacctgaggc acattcaggg gtgggctgca gaggtccatg gctcacaccc 52200 
tggaaaatcc gcccccaaaa gacagtgctg tctccactga ccagtctgtg ggatagtgct 52260 
taagcctgag tggtttctat caacatgtag aatcaggagg tataaagaga tttgctcagg 52320 
catcctgggc cctctctgac cagcaggatc ttcctttaga tcttgacagt gaaacacatc 52380 
tcttctgtgc cccctgtgag ttttctttca ttcattcatt cattcattca ttcattcatt 52440 
cattcattcg agacagagtc ttgctctgtc acccaggctg gagtgccctg gtgtaatctc 52500 
ggctcactgc aacctctgcc tccagggttc aatcgattct cctgcctcag cctcccgagt 52560 
agctgggatg acaggtgcgc accaccatgc ctggctaatt tttgtatttt tagtagagac 52620 
agggtttcac catgttggcc aggctggtct cgaactcctg acctcaggtg atccgcccgc 52680 
ctcagcctcc caaagtgctg ggattacagg catgagccac cgcgcccggc ctgagttttc 52740 
cttttatgaa ggacctgctt ggttggttgc ctgccacatg ttgtcagcac catgggccca 52800 
ggactgctga ggagctgttg atgccctcgc tctcccagag ccaccggctc tgttagataa 52860 
ttcacatgca gtctggccac tgtcctacgt cctcattcac aaagagcaga catttcgtag 52920 
aagatgaggg cctgggagta acctccctgc atgtttttct ataaaggcat agtggttaag 52980 
tccttccagc tcattgacca ttggagaatt ttatggaggc tgtagactag gggctggtaa 53040 
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actaagggcc caggggccaa atccagcctg ccacctactt ttgtaaataa agttttcttg 53100 
gtgcacagcc atgcccattc attcatttgc acaatgtctg tggctgcttt catgccaaaa 53160 
gcaagagaac tgagtggtta tgctggagac ctacggcctt caaagcccca gacctcacgt 53220 
ctggcccttg acagacagag cttccccagc cctgctgcgc atcctggccc agcatgtgct 53280 
gtgtgtgtga tttcagcttg caggagccgt ggttaggaat tgtccctgtg ttggtccatt 53340 
ttgcattgct atgaaggagc acctgaggcc gggtagatta tgaaggaaag aggtctgtct 53400 
ggctcatggt tctgtaggca gcaccagtat ggcacccgca tctgctcagc ttctagtgag 53460 
gtctcaggaa gctttgactc atggtgaaag tcgaagcggg agcaggtgca tcacatggtg 53520 
a gagagggag caacggagag agagagagag cgcctctccc tcttgccctc accttgagag 53580 
gagatgccag gctcctttaa gtaaccagct cccatgtgaa ctcacagtga gagcccattt 53640 
gctactgcgg agagggcacc aggcatctgc tcccatgacc caaacactgc ccaccaggcc 53700 
ctacctccaa ccttggggtc atattttatt ctgttctatg ctatgctatg ctatgccatg 53760 
ccatgccatg ccatgctatt cctattctat tatttgagac agaatctcgc tctgttgccc 53820 
aggctggagt gcagtggcat gatcttggct cactgcaacc tccacctccc aggttcaagc 53880 
gattctcctg tatcagcctc ccgagtagct gggattacag gcacacacca ccacacccgg 53940 
ctaatttttg tattttcaat agagatgggg tttcaccatg ttggccaggc tggtctcaaa 54000 
ctcctggcct caagtgatcc acctacctcg gcctcccaaa gtgccatgat tacagatgtg 54060 
agtcactgcg cccagtgagg gtcacatttc cgttgagatt tggaggggca gacgttggag 54120 
ccatctgagc cccctcgtcc cgctctagct tctcctcccg tgtgccccgc ggtgctggtg 54180 
gcaggccctt acgccggttc tggctgcatg ctctgttcca gaagctttct tccctgcttg 54240 
gttaccagaa aatcatccca tccattacaa ggacagggtc cccttatctc ccattcccag 54300 
ggcaggacac cgggggcagg gcaggtgggg aactgagcaa gttctctggg ggcaggcgtg 54360 
gctatggctc cctctgggtg ggcgtctggg gaggggtgga ggcagccgtc agcgccctgg 54420 
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cttgctcttc ctccctggcc agagactgtg gccttgtgct gctcccgtgt gggctgcctg 54480 
cacctccagt gggttgtgct ccctcccctc ccctcccctc aagctctgct gagcaccact 54540 
gccttccaca gcccccactc tcgggaggcg aggctcctcg tggccattcc tgtccttggc 54600 
acccaccccc ccaccaacct ggtagagcct tgggcggggt ctgttactcc ttgcatggcg 54660 
tagacctccc cacagtaggc acctgacaca tacctcctgg ggggcaggca ggaggtgcgt 54720 
tgaggtctca gccctggcag tccctcccct gcgtggcata ggcctcgcca cagggtcatc 54780 
gagggtgggt ggagactgta ctagaccact ccccgctggt cctagaaagg gtcccatctg 54840 
tctgctctct gtttggagtc cagaccttgg ttgctgtgcc ctgcatggtg ggctgggggg 54900 
caccctccag cctctctgag tgcatggcct ctccttgcag ccatctgcct gcccaaccag 54960 
ttccggtgtg cgagcggcca gtgtgtcctc atcaaacagc agtgcgactc cttccccgac 55020 
tgtatcgacg gctccgacga gctcatgtgt ggtgagccag cttctggcac ggggaagggg 55080 
cgtccgggct gggttccccc aggaacgtgg agtttagggg aggagacgtg cctttccagc 55140 
ggggctgggg gctgtgtggg agactcaggc ggctgggagg ctccttgcgg gaggcaggga 55200 
agcctttccc agggcagcgg ccaggaggac agactgtgag ctgtgggctc ggcggctaca 55260 
gagtctgcct cagtgggcgg ggctgatggt gtccaggtgc ctgcagcacg cacccaccca 55320 
cgggaccttg ctgagcagcg tctgtcaggc agcaagatta cccgagggct gcagtggtcc 55380 
tgttccctgg cagcttactg tctggctgag gaggagtgat gttcacatat gcacacatgt 55440 
catgtgcaca cacatgtaca tgacaacatc ccacatgctc ctcaaatagc atgacctgta 55500 
cagtcacgga tatagggcct aggggatagg aggccaagac agtcagggaa gactttccag 55560 
aggcagtggc tcctgaaagg ctgtctgatt caggcaggaa gggagctgag ttcagatagg 55620 
aagtagcaat gagtcattgt gtctggggac atggccactc cttcgctgca gagggacctg 55680 
ggctgagagc tcctctctta tggctgcagt cgggagagaa gtctgttggg gggagaaggg 55740 
ggcttcctca agggactccc tgtgcccttt ggcaccttcg tgccaggtca ggcttgaggc 55800 
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ctgaaggcag tggtgggggc caccaagggt cgcctcctct gctgggcaag ttcccagtct 55860 
gacgggcctg tgccgtgggc cccagctgtg ggggcgctgt tgatgcg'cag ccaggcctcg 55920 
ccgccagagc ccgcacgctt ccattccgct gacttcatcg acgccctcag gatcgctggg 55980 
ccggccctgt gggagagtga atgtggcttt tgccaaagtt gagtctggag cctggaaact 56040 
tccctatggg cagccttgat agtggagtgg cccaaggagc ccacccagcc gaccctgccc 56100 
ctcccgtggc tggtgggcgg caccaggggc tgcctggctt tgctcgttca ccaacatcac 56160 
ctgggctggc cagggcgcgc tcacttctgc caccaccgag ggccctgggc gaaggagtga 56220 
ataccaggct gccttggcag ggatgtgttg agggctgtgg ggagtcggac agcggcgggg 56280 
gtcagaggag gaggagggtg caccgtgcag gctgaagggc cacgttaccc tgaggttggc 56340 
caggctcccc aggcctagcc tcccagctcc cccactttct ccccaccctc caccagtggc 56400 
aaagccagcc ccttcagggc gcacggtgtc tgcccccaag gagggcccat tccgttgggg 56460 
ttaatgttgg ccacctcttt ctgtttgtct ctggcagaaa tcaccaagcc gccctcagac 56520 
gacagcccgg cccacagcag tgccatcggg cccgtcattg gcatcatcct ctctctcttc 56580 
gtcatgggtg gtgtctattt tgtgtgccag cgcgtggtgt gccagcgcta tgcgggggcc 56640 
aacgggccct tcccgcacga gtatgtcagc gggaccccgc acgtgcccct caatttcata 56700 
gccccgggcg gttcccagca tggccccttc acaggtaagg agcctgagat atggaatgat 56760 
ctggaggagg caggagagta gtctgggcag ctttggggag tggagcaggg atgtgctacc 56820 
ccaggccctc ttgcacatgt ggcagacatt gctaatcgat cacagcattc agcctttccc 56880 
actgagcctg tgcttggcat cagaatcctt caacacagag gcctgcatgg ctgtagcaac 56940 
ccaccctttg gcactgtagg tgtggagaaa gctccttgga cttgaccttc atattctagt 57000 
aggacatgtg ctgtgttgtc cacaaatcct catgtaccct agaaatgaat gtgggggcgg 57060 
ctgggctctc tccagagctg aaggaatcac tctgtaccat acagcagctt tgtcttgagt 57120 
gcagctggga tttgtggctg agcagttaca attcctacgt ggcccaggca ccaggaacgc 57180 
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aggctgtgtt tgtagatggc tgggcagccg caccgcagag ctgcaccatg ctggtttgta 57240 
tcacatgggt gaccatggta tgtctaagaa ggtggagtcc ctgtgaggtc tgcaggtgcc 57300 
cccacagctc caggccacct tgaggattgc ctctgcctgc ccagccctga gttccctctc 57360 
ccctgtcctg tcccactgtc accccaagcc ggcctcattg ggagcctgtt ggatggcagg 57420 
gtatagatgt aacctgattc tctctgggga gcggggttat ctggcttctc aagagctcct 57480 
aggagcccac agtggtggca ccatcacagt cgcagcagcc cccagagaac gcggccctgt 57540 
ctgttcctgg cgtgctctgt gctgccccgc ctgggttccc tgccccagtc gcaggcccct 57600 
tggaggaggt accatgtgtc tcccgtttca cagatgagcc ccggggagct cactctagta 57660 
gtggccagag aggcctgcgg ctcagggagc ggggcacatt tccaacagga cacaccgccc 57720 
tggtctgagt ctcgtgggta gtgggagcag aggagagcgc cctatgtctg tggggcggct 57780 
tggctgagcc tggaagccac ctgacctccc ccgtcccttc cctgccaggc atcgcatgcg 57840 
gaaagtccat gatgagctcc gtgagcctga tggggggccg gggcggggtg cccctctacg 57900 
accggaacca cgtcacaggg gcctcgtcca gcagctcgtc cagcacgaag gccacgctgt 57960 
acccgccggt gaggggcggg gccggggagg ggcggggcgg gatggggctg tgggcccctc 58020 
ccaccgtcag tgctggccac cggaggcttc ccgggttcct gggggctgtg ccaccgcctc 58080 
tgaggcatgc ttgctttctt cccttttcaa acccttctgc ttccttcttt aatgacattg 58140 
ttgattgtgg ataatctgaa aactacacaa aaatataaag agccaaaatc tcacccaaat 58200 
ccacctccta gagtggctgt tgggctccgt cagcatccag gcggccgtct gtgttccgca 58260 
cggcccagcc catcgatagc cgcctgcacc aggcctgtct gccctctgtg agcctcccca 5S320 
cagggttccc tccacaaaca ccctgttctc ccacccaggg ctggctgctt cctggaaaac 58380 
agctggatgg ttttgtgcat gacagacaaa cacagggtga ttttcgtggc taaaatactc 58440 
cctggagctt ttggcagggt gaggggctgg ctccagctga gccacgcctt gagtgaaatg 58500 
actgtgagga gaataaactg ccgctgccct ccaggatcac tggggctggc tggggagaac 58560 
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ccccgtttct gggagcacag tcccaggatg ccaaggcgag cttggtgccg agatgtgaac 58620 
tcctgagtgt aaacagcggg ggctgacttg acatgctttg tatgcttttc atttgttcct 58680 
gcagctgtat gcccctaagg tgagtccagc ccccttctgc ttcctctggg gcctcgccag 58740 
tgagccccac cttgctgggg ctggttcctc ctgcccttct gggtatccct cacatctggg 58800 
gtcttgtctt cttgttttct ttttcttttt tttttgagac ggagtttcac ttttgttgcc 58860 
caggcttcag tgcaatggtg tgatctctag gctcaccgca acctctgcct cccaggttca 58920 
agcagttccc ctgcctcagc ctccctagta gctgggatta caggcatgtg ccaccacgcc 58980 
cagctaattt tgtattttta gtagagatgg ggtttctcca tgttggtcag gctgatcttg 59040 
aactccctac ctcaggtgat ccgcccacct tggcctccca aagtgctggg attacaggcg 59100 
tgagccaccg cacctggcct ttttcttttc ttttcttttc ttttttctga gacagggtct 59160 
cgctctgtca cccaggctgg agtgcaatgg tgtcatcatg gctaactgca gcctctacct 59220 
tctaggctca agcaatcctc ccatctcagc ccctaagtag ctaggactgc acgcatgcat 59280 
ccccatgccc agctaatatt tacatttttt gtagagatga agtttcacta tattgcccag 59340 
gctggtctcc aactcctgga ctcgagcgat cctcctgcct cggcctcccc aggtgctggg 59400 
attacaggcg tgagccaccg tgcctggcct ggggtattgt cttcttatgg cacctgactg 59460 
tggtgggccc tgggaaggaa gtagcagaag agggttcttc ttggtttcct ggacagtaac 59520 
tgagtgttct ggaggcccca gggcctggct ttgtttaggg acaaagggaa ctggtaacca 59580 
gaagccgaga gtttaaacac ccactgccct tcttccctgc tcctgctgct gcaacccagc 59640 
ttaaccagcc aggagtgcta ggaacccaag cagggccccc gagcacacag caggcagctc 59700 
acgaattctc ttttcctgtt ctcccttggg agctgggagg atcttaatca ggcaataaga 59760 
gatggcactg agcagccagc taatttttta aatcacttta ttgtttaacc atatgactca 59820 
cccacttaaa aaagggtaca gttcagtggg ttttagtgta ttcacagatg tgtgcaaccc 59880" 
tcaccacagt taattttaga acattttcct gcccctaaaa gaaactctgc atgaagccag 59940 
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ctgtttttaa attagcaaag ttattttgca tcctttaaat atatgttcat ggtacaaaat 60000 
tcaaaagata cagaagagtc tgcagtccaa agagactccg cccccatgac gccaagcagg 60060 
catccctggg aggcatggcc tcctgcagtg tgtttcttct atgtcccccc aggggtcatc 60120 
tgtacatatg caagcataca agagcgtgga ctttgttttc caagccagaa gataattgta 60180 
gatttatgtg cagttgtgag aaagagcaca gacccattta tcctctgcct ggtttccccc 60240 
agtgctgcct gccatcttgc atgacttcca ttcctatcat aagcaagaca ctgataacga 60300 
ttctttcacc ttattcagat tgacataagt gttttttgtt tgttcttgag acaaacttcc 60360 
tctgtcaccc agtgggagtg cagtggcaca atcacagctc actgcagcct caaactcctg 60420 
ggctcaagcg attctcctgc ctcagtcccc tcaagtagct cagatggcag gtgtgcacca 60480 
tcatgccagg ctaattttta aattttttgt ggaggtgagg cctcactaaa tttcctgggc 60540 
tagtcttgaa ctcctgagct aaagtgatcc tcctgcctca gcctcccaaa gtggtaggat 60600 
tacaggcatg agccactgcg cctgggctga catatgtgtt ttcgtaagcc cgaaagatag 60660 
catctgaaga gtcaacattg agccttgcct tttgctgcta acgatgtata aaagctgctg 60720 
ttctgagcat ttcggaggct cccagctgcc gtgtgcaccc tgcctagagc tctaccgtaa 60780 
cccatctccg ggaggaggtg ctattgtttt cctcattttg caacaaggag gctgaagaac 60840 
tgagcatgaa ccactggcct gggtcgttcg gttggtaggc agtggggcca ggccatccaa 60900 
ctcacaacca ccttctactc tgcttccccc gcaccctgaa gtttgttctg ttttgaggac 60960 
acagccgtca cattcttggt ggctgaacag cactccttgt caggcgtggc tgggccccca 61020 
ctggagggca tcatggtcct ctctcctgct gcggttgaac cttggctgtt tcaaccactc 61080 
ctgccaagtg gccctctgaa agggacagtc catcttttct cagcagaggg ccacactggc 61 140 
aaaacggtcc ctggcaccct ttctctccac ctgtctaata tagagtaaaa atggtatcat 61200 
gttaagatct tcatttatat ttattttatc atgaatgatg taagcatcat tttgtgtgtt 61260 
taagaacctt tgggcccagc gtgatggctt gcagctgtaa tctcagcact ttaggaggct 6 1320 
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gagatgagcg gatcacttga ggccgggagt ttgagaccag cctggccaac atggagaaac 61380 
cccgtctcta gtaaaaattt aaaaattagc cgggtatggt gatcccagct acttgggagt 61440 
ctgaagcatg agaattgctt gaacatggga ggcggaggtt gcagtgagcc gagatcgcgc 61500 
cattgcactc cagcctgggc gacagagcga gactctgtct caaaaaaaaa aaaaaaaaag 6 1560 
aaaagaaaag aaattatcaa tctcctcttt tatggcatat atatatatat atatatatat 61620 
atatatatat atatatattt tttttttttg gttatgttca gaaaggcctt ccctgctctg 61680 
atcataaaaa acaacttatt ttcacactct ctctcttttt tttttgagac agagttttgc 61740 
tcctgttgcc caggctggag tgcagtggcg caatctcagc tcactgtaac ctccgcctcc 61800 
cgggttggag tgattctcct gccttacctt cccgagtagc tgggattata ggcatgcacc 61860 
accatgcctg gctaattttg tacttttagt agagacgggg gtttctccat gttggtcagg 61920 
ctggtctcga actcgcgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 61980 
ttacagacgt gagccaccat gcccagccca cactctcttt cttaacgtcc tcctcctttc 62040 
gttttacgtt cacatcttta attcttctgg gatgtaatta gatttgatga gcaaggtggg 62100 
catccagctt gtttcttggc tgatggctta tgggtggcgt gaattagtcg gggtctatca 62160 
ggaggcagaa actctatgag aatttgaaca gagaaagttc cgtctacagg cttattacca 62220 
gggactggaa tagcagaaat tgaacagtga gatgtacaga gaactctaag aatgcaggaa 62280 
taggccaggc atggtggctc acacctgtca tcccagcact ttgggagacc aaggcgggtg 62340 
gatcacctga ggtcaggagt tcgagaccag cctggccaac atagtgaaac cccatctcta 62400 
ctaaaaatac aaaaaaatta gctgggtgtg gtggcgcatg cctgtaatcc cagctactcg 62460 
ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt gagccgagat 62520 
catgccactg tactccagcc tgggtggaag agcggaactc tgtctgaaaa aaaaaaaaaa 62580 
aacaagaagt tcaacttgaa gggaaaaatg ccgtattgtc tttccctttg ttatgtcacc 62640 
agggcacagt ccatcccagg ctggcgctga tccacgggct ggagaggggc tgccccagaa . 62700 
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gaggacatgc caggaagggc ttggctggtg ttcaggagcc caggccaggt caggtcaaga 62760 
ggtgttgagg ctggacggga gaggccagct aggggctcat gtaggatatg aggggtcggc 62820 
ccatttcaac gtggaaactg agctcttctg cttctctttc ttcttcactg cattaagatt 62880 
caataccgct tgggaagcag gtatttccct tcctataaag gatggttggg agcctgagtg 62940 
ttgggagaaa gtgtagccgc tgagttacta acaactaggg ctgccgtcaa gcctatgggg 63000 
aaagagagaa gaggacattt ggaaggagag agatcaagct gtggcaccct gggagaggac 63060 
cacagaaaag aggccagtga gggggttccc cggtggcatc tgaaggtgtg gcccaaccag 63 120 
gaggtccaga ggctgccagc cgagtggccc aggagaggga acctcacagg ggctgagtgg 63 180 
gacccaagcc ctatccaccg tcctaaccac ccacatttct cgggaacaag acctcccaca 63240 
gtggcctccc cggcagtgga aatagccaaa ctggcaacat ggactttctt caactgcccg 63300 
ggcgatgctg cctcagtgcc ccagggcagg caggaagctc ccacacccat tctggaatga 63360 
ggggttggag gaaggctgag ctgagcaaag gacccatctc tgctctggtt ggtggggagg 63420 
gagcccatta tacaagagac ccctcagggc tcagtgaggg gtgacagaga cttggggagt 63480 
agtggctgtc actgcagagg tgagagggtt tggagagaag gtacatgcct ttttggccac 63540 
attgagtagc acctggtagc cagttagtaa cgtgtattgg ataaacaaaa gattaaacgg 63600 
atgcaaaaaa aaatgttggc tttgcttctt tttacccaaa cctcagttcc ctcaagtaga 63660 
ttctgggaac accccctacc tggctggact gttgtgaagt ttaaataagc caggttaact 63720 
tcacctcctc ctttaagaca cagctcagac actgcctcct ccaagaagcc ccctctggct 63780 
tcctgtgtga atatgacggc cctctgggct ctagggtatc ttagaacaat gcttccttat 63840 
ggctttggaa ccccgctgtc tcctggattg ggagcaaatg caggggagga gccacacctg 63900 
actaatctct gggtctccca gcacataagt ggcataaggg cagggctgtg cccgcttcag 63960 
gcacttactg aaggatgtac ttggcagagg gtaggcagcc ggcggatgag cccctcactc 64020 
tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct ccctgggctg 64080 
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tcacqacctc tgcacatcca gccccattga tcaagggttc aatttttggg gtcctgttgg 64140 
gaggccagga gactctctcc aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 64200 
tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt atttatttat ttatttattt 64260 
atttatttat ttatttattt tgagacgcag tctcgctctg ttgcccaggt tggagggtgg 64320 
tggcatgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 64380 
actcttcctg agtagccgga ttacaggcgc acgcaccatg cctggctaat tattttgttt 64440 
ttttagtaga gacagggttt cgccacgttg cccaggctgg tcttgaatcc ctggcctcaa 64500 
gcgatccgcc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc caccgtgccc 64560 
gcccagccta ggggtacatg aaactttttt tttttttttt ttgagacaga gtttcactct 64620 
gtcctcaggc tggagtgcag tggcgtgatc tcggcgtact gcaatctccg cctcccggtt 64680 
caagcgattc tcctgcctca gcctcccgag tagctgggat tgcaggcacg cgccaccaca 64740 
cccagctaat ttttgtattt ttagtagaga cgggctttca ccatgtggga caggatggtc 64800 
tcgatctcct gacctcgtga tccgcccgcc tcagcctccg aaagtgctgg gattacaggc 64860 
ctgagccacc gtgcccagcc atgatgtttt gatacaggca tataacgtat aataatcaca 64920 
tcagggtaaa tgatgtaacc atcacatcaa gcatttatcc tttgtgttac aaaaaaaaat 64980 
ctaattatac tttcctactt attctttttt tttttttttt ttgagacgga gtctccctca 65040 
gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 65100 
tctgcctcct gggttcatgc cattctcctg tctcagcctc gcgagtagct gggactacag 65160 
gcgcctgcca ccgtgcccgg ctaatttttt tttttgtatt tttggtagag acagggtttc 65220 
accgtgttag ccaggatggt ctcgatctcc tgacctcata atccgcccgt ctcggcctcc 65280 
caaagtgctg ggattacagg catgagccac cgcccccagc ctatttattc ttaaatgtac 65340 
aataaattat tgttgactcc agtcaccctg ctgtgctacc aaatacggat cttcttcatt 65400 
ctatctaact gtatttctgt acctgttaac catctctcct ccacctcacc ccccaaaccc 65460 
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actacccttc tcagcctctg gtaaccatcc ttctactctc tatctctatg agttcaattg 65520 
tattaatttt tagctccccg gccgggcacg gtggctcacg cctgtaatcc cagcacttca 65580 
ggaggctgag gcaggtggat cacgaggtca ggagtttgag accagcctgg ccaacatggt 65640 
ggaaccccat ctctactaaa aacacaaaaa ttagctgggc gtggtggtgg gcgcttgtag 65700 
tcccagctac ttgggaggct gaggcaggag aatcgcttga aactgggagg cagaggttgc 65760 
agtgagccaa gattgcgcca ctgcactcca gtctgggtga cagagtaaga ttccatcccg 65820 
aaaaaaaaaa agtttagctc ccacaaataa gtgagaacac gtgaagtttc tctttctgtg 65880 
cctcgcttgt ttcacttaac ataatgacct ccagttccat ccacgttgtt gctttgttat 65940 
aaatgacagg atcttggtca ggcgcagtgg ctcatgcctg taatcccagc actttgggag 66000 
gctgaggtgg actgatcatg aggtcaagag atcgagacca tcctggctaa cacagtgaaa 66060 
ccccgtctct actaaaaata caagaaatta gccgggcgtg gtggtgggca cccatttccg 66120 
ccccttctcg ggacgctgat gcacgacata ttacccatcc ccggaagact aatcctcccc 66180 
cactctatat tgtacctctt cctttctcct ccacgcgatt ccccgagtaa cccgtcttcc 66240 
ctccctcctc ggattacgct cacctttccg cttcaatcac gttgctccgt ccccttcccc 66300 
attcgtacca ctcctcactt tcgtcttcct acccccacta tcccttttcg tcctctctat 66360 
tccttactta ctcctccccc ttctcttcat acttcattcc ctccgctctt cccactcgcg 66420 
ctcccacttt cacctagttg ccctcaccta cgttgccatc tcgccccttc ttcagctctc 66480 
ggcctctcac ccatctgtcc tctctcttac ctctctcctc atctcgctca gacatctctc 66540 
tagactatcc ctcactttac cttctcagtc gtcttcttcc tatccttcgt tctccatgat 66600 
cttcacgtcg ccatctcttt tcgccccttt catatgtctc tcttcatgtt ctcactatca 66660 
ttctcatgat cactatcgtt ctcactactt atcactcccc tctttcttca tcaattcctc 66720 
tccgtcattc tcgtctctct cttacaaccg ccttccttgt gctatctaac tcaaccatgc 66780 
ctctcctact ctctctctat cgcccctcca tcgcttatgc atcctcttct attgcacacc 66840 
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cgcccctcca tcgcttatgc atcctcttct attgcacacc gcccctccat cgcttatgca 66900 
tcctcttcta ttgcacatcc tcttctattg cac 66933 

<210> 12 
<211> 21 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

<223 > Artificial sequence is a primer. 
<400> 12 

ctgagcggaa ttcgtgagac c 21 

<210> 13 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400> 13 

ttggtctcac gtattccgct cga 23 

<210> 14 
<211> 20 
<212> DNA 

< 213 > Artificial Sequence 
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<220> 

<223 > Artificial sequence is a primer. 
<400> 14 

ctcgagaatt ctggatcctc 20 

<210> 15 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400> 15 

ttgaggatcc agaattctcg ag 22 

<210> 16 

<211> 21 

<212> DNA 

<213 > Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400> 16 

tgtatgcgaa ttcgctgcgc g 21 

<210> 17 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223 > Artificial sequence is a primer. 
<400> 17 

ttcgcgcagc gaattcgcat aca 23 

<210> 18 

<211> 21 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400> 18 

gtccactgaa ttctcagtga g 21 

<210> 19 

<211> 23 

<212> DNA 

<213 > Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400> 19 

ttgtcactga gaattcagtg gac 23 

<210> 20 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 

<400> 20 
gaatccgaat tcctggtcag c 

<210> 21 

<211>23 

<212> DNA 

<213 > Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 



21 



<400 > 21 

ttgctgacca ggaattcgga ttc 23 



<210> 22 

<211> 33 

<212> DNA 

<213 > Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400 > 22 

cuacuacuac uactgagcgg aattcgtgag acc 33 
<210> 23 
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<211> 32 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400 > 23 

cuacuacuac uactcgagaa ttctggatcc tc 32 

<210> 24 
<211>33 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400> 24 

cuacuacuac uatgtatgcg aattcgctgc gcg 33 

<210> 25 
<211> 33 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400> 25 

cuacuacuac uagtccactg aattctcagt gag 33 
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<210> 26 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400 > 26 

cuacuacuac uagaatccga attcctggtc age 

<210> 27 

<211> 45 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400> 27 

aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 

<210> 28 

<211> 13 

<212> DNA 

<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400> 28 
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aattcggcac gag 13 

<210> 29 
<211> 9 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400 > 29 

ctcgtgccg 9 

<210> 30 

<211> 14 

<212> DNA 

<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400 > 30 

gtacgacggc cagt 14 

<210> 31 
<211> 16 
<212> DNA 

< 213 > Artificial Sequence 
<220> 
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< 223 > Artificial sequence is a primer. 

<400> 31 
aacagctatg accatg 

<210> 32 
<211> 18 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

<223 > Artificial sequence is a primer. 

<400> 32 
ccaagttctg agaagtcc 

<210> 33 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 

<400> 33 
aatacctgaa accatacctg 

<210> 34 

<211> 57 

<212> DNA 

< 213 > Artificial Sequence 
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<220> 

< 223 > Artificial sequence is a primer. 
<400> 34 

agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 57 

<210> 35 

<211> 56 

<212> DNA 

<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400 > 35 

tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 56 

<210> 36 
<211> 27 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400> 36 

agctgctcgt agctgtctct ccctgga 27 

<210> 37 
<211> 27 
<212> DNA 
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<213 > Artificial Sequence 
<220> 

<223 > Artificial sequence is a primer. 
<400 > 37 

gccgtaatgc ggcacaggga ataagct 27 

<210> 38 

<211> 20 

<212> DNA 

<213> Artificial Sequence' 

<220> 

<223 > Artificial sequence is a primer. 
<400> 38 

gagaggctat atccctgggc 2.0 

<210> 39 

<211> 20 

<212> DNA 

<213 > Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400> 39 

acagcacgtg tttaaagggg 20 

<210> 40 
<211> 163 
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<212> DNA 

< 213 > Homo sapiens 

<400> 40 



actaaagcgc cgccgccgcg ccatggagcc cgagtgagct cggcgcgggc ccgtccggcc 60 
gccggacaac atggaggcag ctccgcccgg gccgccgtgg ccgctgctgc tgctgctgct 120 
gctgctgctg gcgctgtgcg gctgcccggc ccccgccgcg gcc 163 



<210> 41 
<211> 419 
<212> DNA 
<213> Homo sapiens 
<400> 41 



gccccacagc ctcgccgctc ctgctatttg ccaaccgccg ggacgtacgg ctggtggacg 60 

SS^ aSt J^ gCtSSag tCCaccatc S Wcagcgg cctggaggat gcggccgcag 120 

tggacttcca gttttccaag ggagccgtgt actggacaga cgtgagcgag gaggccatca 180 
agcagaccta cctgaaccag acgggggccg ccgtgcagaa cgtggtcatc tccggcctgg 240 

tctctcccga cggcctcgcc tgcgactggg tgggcaagaa gctgtactgg acggactcag 300 

agaccaaccg catcgaggtg gccaacctca atggcacatc ccggaaggtg ctcttctggc 360 

aggaccttga ccagccgagg gccatcgcct tggaccccgc tcacgggtaa accctgctg 419 



<210> 42 
<211> 221 
<212> DNA 
<213> Homo sapiens 
<400> 42 



ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 60 

gggatggatg gcagcacccg gaagatcatt gtggactcgg acatttactg gcccaatgga 120 

ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 180 
caccgtgcca acctggacgg ctcgttccgg taggtaccca c 2 21 



<210> 43 
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<211> 221 
<212> DNA 
<213> Homo sapiens 
<400> 43 



tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 60 

tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 120 
agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggacatcc 180 
aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 221 



<210> 44 
<211> 156 
<212> DNA 
< 213 > Homo sapiens 



<400> 44 

tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 60 
gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 120 
cggcaggacg tgtaaggcag gtgaggcggt gggacg 156 



<210> 45 
<211> 416 
<212> DNA 
<213> Homo sapiens 
<400 > 45 

ctccacagga gccgaggagg tgctgctgct ggcccggcgg acggacctac ggaggatctc 60 
gctggacacg ccggacttca ccgacatcgt gctgcaggtg gacgacatcc ggcacgccat 120 
tgccatcgac tacgacccgc tagagggcta tgtctactgg acagatgacg aggtgcgggc 180 
catccgcagg gcgtacctgg acgggtctgg ggcgcagacg ctggtcaaca ccgagatcaa 240 
cgaccccgat ggcatcgcgg tcgactgggt ggcccgaaac ctctactgga ccgacacggg 300 
cacggaccgc atcgaggtga cgcgcctcaa cggcacctcc cgcaagatcc tggtgtcgga 360 
ggacctggac gagccccgag ccatcgcact gcaccccgtg atggggtaag acgggc 416 



WO 01/92891 



238 



PCT/US01/16946 



<210> 46 
<211> 198 
<212> DNA 
<213 > Homo sapiens 
<400> 46 

ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 60 
aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 120 
ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 180 
gaggtgaggc tcctgtgg 198 



<210> 47 
<211> 244 
<212> DNA 
< 213 > Homo sapiens 
<400 > 47 

ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 60 
tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 120 
gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc 180 
tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 240 



<210> 48 
<211> 313 
<212> DNA 
<213> Homo sapiens 
<400 > 48 

gttcgcttcc aggaaccaac ccgtgtgcgg acaggaacgg ggggtgcagc cacctgtgct 60 
tctgcacacc ccacgcaacc cggtgtggct gccccatcgg cctggagctg ctgagtgaca 120 
tgaagacctg catcgtgcct gaggcctttt tggtcttcac cagcagagcc gccatccaca 180 
ggatctccct cgagaccaat aacaacgacg tggccatccc gctcacgggc gtcaaggagg 240 
cctcagccct ggactttgat gtgtccaaca accacatcta ctggacagac gtcagcctga 300 
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aggtagcgtg ggc 



<210> 49 
<211> 255 
<212> DNA 
<213> Homo sapiens 
<400> 49 

cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgtggtgg 60 
agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120 
gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 180 
tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 240 
aagtgtttgc ctgtc 255 



<210> 50 
<211> 210 
<212> DNA 
<213> Homo sapiens 
<400> 50 

gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60 
ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 120 
accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag 180 
tcgtccaaca tgctgggtga gggccgggct 210 



<210> 51 
<211> 352 
<212> DNA 
<213> Homo sapiens 
<400 > 51 

gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60 
tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 120 
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ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180 
acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 240 
ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcptcac 300 
actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352 



<210> 52 
<211> 225 
<212> DNA 
<213> Homo sapiens 



<400 > 52 

gcctcctcta cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 60 
atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 120 
aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 180 
atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 225 



<210> 53 
<211> 235 
<212> DNA 
<213> Homo sapiens 



<400 > 53 

ctttgtctta cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca 60 
gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 120 
caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 180 
ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 235 



<210> 54 
<211> 218 
<212> DNA 
< 213 > Homo sapiens 



<400 > 54 
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ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 60 

cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 120 

gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 1 80 
cgcattgaga gctgtgacct gtcaggtacg cgccccgg 218 



<210> 55 
<211> 234 
<212> DNA 
<213> Homo sapiens 
<400> 55 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 60 
gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 120 
gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 180 
ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 234 



<210> 56 
<211> 157 
<212> DNA 
<213 > Homo sapiens 
<400> 56 

ttgtctttgc agcagcccac ccatgtgccc gtgacaatgg tggctgctcc cacatctgta 60 
ttgccaaggg tgatgggaca ccacggtgct catgcccagt ccacctcgtg ctcctgcaga 120 
acctgctgac ctgtggaggt aggtgtgacc taggtgc 157 



<210> 57 
<211> 272 
<212> DNA 
<213 > Homo sapiens. 
<400> 57 

gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60 
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cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 120 

tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg . 180 

gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 240 
agacgaggtg gactgtgacg gtgaggccct cc 272 



<210> 58 
<211> 134 
<212> DNA 
<213> Homo sapiens 
<400> 58 

tctccttgca gccatctgcc tgcccaacca gttccggtgt gcgagcggcc agtgtgtcct 60 
catcaaacag cagtgcgact ccttccccga ctgtatcgac ggctccgacg agctcatgtg 120 
tggtgagcca gctt 134 



<210> 59 
<211> 274 
<212> DNA 
< 213 > Homo sapiens 
<400> 59 

gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 60 
ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120 
tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180 
atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 240 
gccccttcac aggtaaggag cctgagatat ggaa 274 



<210> 60 
<211> 164 
<212> DNA 
<213> Homo sapiens 



<400> 60 
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cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60 
gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 
cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 164 



<210> 61 
<211> 130 
<212> DNA 
<213> Homo sapiens 

<400 > 61 „ 
ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc tccctgtaca 60 
acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac aggtaggaca 120 
tcccctgcag 



<210> 62 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<400> 62 <n 
tcaaacattc cggccactgc gagaccgtac aggccctaca tcattcgagg aatggcgccc eo 

= 180 



ccgacgacgc cctgcagcac cgacgtgtgt gacagcgact acagcgccag ccgctggaag 120 
gccagcaagt actacctgga tttgaactcg gactcagacc cctatccacc cccacccacg 181 
ccccacagcc agtacctgtc ggcggaggac agctgcccgc cctcgcccgc caccgagagg 240 



agctacttcc atctcttccc gccccctccg tccccctgca cggactcatc ctgacctcgg 30U 
ccgggccact ctggcttctc tgtgcccctg taaatagttt taaatatgaa caaagaaaaa 360 
aatatatttt atgatttaaa aaataaatat aattgggatt ttaaaaacat gagaaatgtg 420 
aactgtgatg gggtgggcag ggctgggaga actttgtaca gtggagaaat atttataaac 480 
ttaattttgt aaaaca ^96 
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